Open-Source LLM with RAG

A demonstration project that shows how to build a hybrid Retrieval-Augmented Generation (RAG) system using an OpenAI-compatible base model client (example uses an NVIDIA-compatible client in the notebook) combined with real-time web search via SerpApi. The repository's main artifact is RAG.ipynb, which walks through installing dependencies, wiring up SerpApi, building a hybrid RAG flow, a simple chatbot interface, caching/fallback strategies, and example evaluations.

NOTE: This repository contains example notebook code for demonstration and experimentation. Do NOT commit your real API keys to the repo. Always use environment variables or secret managers.

Highlights / Features

Real-time data search using SerpApi to ground model answers.
Hybrid RAG pipeline that combines retrieval (SerpApi) and generation (base LLM).
Basic interactive chatbot interface (Jupyter/Colab).
Caching for search results to improve speed and reduce quota usage.
Fallback handling when real-time search fails.
Notebook-oriented: easy to run in Google Colab or locally.

Files

RAG.ipynb — Main notebook demonstrating setup, real-time search integration, the hybrid RAG system, chatbot, caching, and example tests.

Requirements

Python 3.8+
Recommended pip packages (create requirements.txt or install individually):
- langchain
- langchain-community (or the current package offering SerpAPI integration)
- openai (or the SDK you use to access your base model — the notebook uses an OpenAI-compatible client)
- requests
- jupyter / ipykernel (if running locally)
- functools (part of stdlib)

Example requirements (put in requirements.txt):

langchain
langchain-community
openai
requests
jupyter
ipykernel

(Adjust package names/versions to match your environment. If langchain-community is unavailable by that name, the notebook may require a package or import path that matches your langchain extensions — check the latest langchain docs.)

Environment variables / Secrets

Before running the notebook, set these environment variables (or configure secrets in Colab):

SERPAPI_API_KEY — Your SerpApi API key (https://serpapi.com/)
NV_API_KEY or the appropriate API key variable for your base model client (the notebook uses an NVIDIA endpoint in examples). If you use OpenAI, set OPENAI_API_KEY.

Important: remove any hard-coded API keys from the code and replace them with environment variables (e.g., os.environ["SERPAPI_API_KEY"] = ... or use Colab secrets).

Usage

Google Colab

Open the notebook via the Colab link at the top of RAG.ipynb or open in Colab: Open in Colab

Set your API keys using Colab's UI or at the top of the notebook:

import os
os.environ["SERPAPI_API_KEY"] = "<your_serpapi_key>"
os.environ["OPENAI_API_KEY"] = "<your_openai_key>"  # or NV_API_KEY if applicable

Run cells sequentially. Follow the notebook instructions and remove any example/hard-coded keys first.

Locally (Jupyter)

Clone the repository:

git clone https://github.com/LIKHITHADITHYA/Open-Source-llm-with-RAG-.git
cd Open-Source-llm-with-RAG-

Install dependencies:

python -m pip install -r requirements.txt

Start Jupyter:
```
jupyter notebook
```
Open RAG.ipynb and set environment variables (or use a local .env and load it safely).

Typical workflow in the notebook

Install and import dependencies (langchain and any wrappers).
Provide API keys via env variables (do not hard-code).
Instantiate SerpApi wrapper and a base model client.
Implement get_realtime_search_results(query) — includes parsing and optional caching.
Implement hybrid_rag_system(query) — calls search function and sends a combined prompt to the model.
Optionally run an interactive start_chatbot() that uses the hybrid system.
Evaluate results and tune prompts, caching, and fallback behavior.

Caching & Robustness

The notebook demonstrates using functools.lru_cache for simple in-process caching. For production or multi-process deployments consider:

Redis or Memcached for shared caching.
Persistent vector DB (FAISS, Milvus) for storing retrieved context and reuse.
Circuit-breakers and retries for API robustness.

Security and Privacy

Never commit API keys to the repository.
Sanitize logs and outputs before sharing.
If you store or index external content, make sure you comply with copyright and privacy policies.

Contributing

Contributions are welcome. Suggested contribution workflow:

Fork the repo.
Create a branch with your change: git checkout -b feature/your-change
Commit and push: git commit -am "Add ..." && git push origin feature/your-change
Open a Pull Request describing your changes.

Example: Add README to the repo locally

If you want to add this README file to the repository locally and push it:

# from the repository root
git checkout -b docs/add-readme
# create README.md with content above (save it)
git add README.md
git commit -m "Add README with usage and setup instructions"
git push origin docs/add-readme
# Open a Pull Request on GitHub from the 'docs/add-readme' branch

Troubleshooting

ModuleNotFoundError: if langchain_community import fails, ensure the right package/version is installed and check the import path; langchain extensions change frequently.
If SerpApi returns unexpected formats, inspect raw responses and adapt the parsing logic accordingly.
If the base model API returns streaming output or different response shapes, adjust extraction code to match the client SDK.

License

Specify your license here (e.g., MIT). Example:

MIT License

Contact

If you have questions, open an issue in the repo or contact the maintainer (LIKHITHADITHYA) via GitHub.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
.gitignore		.gitignore
HYBRID RAG.ipynb		HYBRID RAG.ipynb
README.md		README.md
WALKTHROUGH.md		WALKTHROUGH.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Open-Source LLM with RAG

Highlights / Features

Files

Requirements

Environment variables / Secrets

Usage

Google Colab

Locally (Jupyter)

Typical workflow in the notebook

Caching & Robustness

Security and Privacy

Contributing

Example: Add README to the repo locally

Troubleshooting

License

Contact

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Open-Source LLM with RAG

Highlights / Features

Files

Requirements

Environment variables / Secrets

Usage

Google Colab

Locally (Jupyter)

Typical workflow in the notebook

Caching & Robustness

Security and Privacy

Contributing

Example: Add README to the repo locally

Troubleshooting

License

Contact

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages