🔬 RRI — Research & Repository Intelligence

A personalized research intelligence platform that helps researchers discover, organize, and interact with academic papers & repositories using AI-powered chat, semantic search, and automated trend analysis.

Quick Start • Features • Screenshots • CLI Tool • API Reference • Architecture • Contributing

📖 Overview

RRI (Research & Repository Intelligence) is a full-stack, self-hosted platform designed for researchers, engineers, and teams who want to automate the discovery and analysis of academic papers, open-source repositories, and AI/ML community discussions — all in one place.

RRI continuously collects data from 11+ sources, processes it with NLP pipelines, indexes everything into a vector database for semantic search, and provides an AI chat interface (RAG) so you can ask questions about your research corpus in natural language.

🎯 Who is RRI for?

Audience	Use Case
🎓 Researchers	Track new papers in your field, discover related work, get AI-generated summaries
👩‍💻 ML Engineers	Monitor trending GitHub repos, HuggingFace models, and community discussions
🏢 Research Teams	Centralized knowledge base with document chat, bookmarks, and weekly digests
📊 Tech Leads	Tech radar, trend analysis, and automated intelligence reports

⚡ Key Highlights

🔎 Semantic Search across 10,000+ papers & repos with vector similarity	🤖 Chat with your documents — upload PDFs, DOCX, or ingest GitHub repos
📡 Auto-collect from 11+ sources (ArXiv, GitHub, HuggingFace, OpenReview...)	💻 Built-in CLI — run OSINT tasks directly from the terminal
📊 Trending & Analytics — Tech Radar, trend charts, community insights	📚 Personal Library — bookmarks, folders, weekly research digests

✨ Features

📄 Multi-Source Data Collection

RRI automatically collects and aggregates data from 11+ academic and developer sources:

Source	Type	Data Collected
🔬 ArXiv	Papers	Pre-prints with abstracts, categories, authors
📚 Semantic Scholar	Papers	Citations, references, influence scores
🌐 OpenAlex	Papers	Open-access metadata, concepts, institutions
💻 Papers With Code	Papers + Code	Paper-code links, benchmarks, tasks
🐙 GitHub	Repositories	Stars, forks, languages, topics, README
🤗 HuggingFace	Models + Papers	Model cards, downloads, daily papers
📝 OpenReview	Peer Reviews	ICLR/NeurIPS reviews, ratings, decisions
🟠 Hacker News	Discussions	AI/ML posts, scores, comments
✍️ Dev.to	Blog Posts	Technical articles, tags, reactions
🐘 Mastodon	Social Posts	Research community discussions
🔗 Lemmy	Forum Posts	Federated community discussions

🔍 Semantic Search

Vector-based search powered by Qdrant and BGE embeddings
Search across papers and repositories simultaneously
Relevance scoring with percentage match display
Filter results by type (Papers / Repos / All)

🤖 AI-Powered Chat (RAG)

Retrieval-Augmented Generation pipeline with context-aware answers
Dual LLM support: Ollama (local, private) + OpenAI GPT-4o (cloud)
Document Chat: Upload PDFs, DOCX, PPTX → ask questions about your documents
Repo Ingestion: Ingest entire GitHub repositories via gitingest → chat about code
Full context mode vs. RAG retrieval mode per conversation
Conversation history with multi-turn support

📊 Analytics & Trending

Papers Overview: Category distribution (donut chart), yearly publication trends (bar chart)
Trending Papers & Repos: Filterable by period (day/week/month), category, language
Tech Radar: Auto-generated technology trend analysis
HuggingFace Dashboard: Model rankings, download stats, task distribution
Community Keywords: Trending topics across platforms with keyword analysis

📋 Knowledge Management

Bookmarks & Folders: Organize papers and repos into custom folders
My Library: Personal document collection with folder tree
Weekly Reports: Auto-generated research digest summaries
Paper-Code Linking: Automatically match papers to their implementations
Citation Enrichment: Bulk update citation counts from Semantic Scholar

🔐 Authentication & Multi-User

JWT-based authentication with user registration/login
Per-user document libraries, bookmarks, and conversations
Role-based access to AI chat features

🌙 Modern UI/UX

Dark/Light theme toggle with smooth transitions
Responsive design with glassmorphism effects and micro-animations
Interactive charts built with Recharts
Knowledge graph visualization with react-force-graph-2d
Global search bar with keyboard shortcut (/)

💻 CLI Tool

RRI includes a built-in CLI for running OSINT tasks from the terminal. See the full CLI documentation.

rri collect arxiv --query "LLM" --category cs.AI --max-results 100
rri search vector "multi-modal RAG" --limit 10
rri analyze paper 2401.12345 --cloud
rri export report --period weekly --format md
rri chat   # Interactive RAG chat

📸 Screenshots

Landing Page

Dashboard

Papers — Overview & Analytics

Papers — Browse & Filter

Paper Detail

Semantic Search

AI Chat (RAG)

Community & OpenReview

HuggingFace Models

Repositories

OpenReview

My Library

Weekly Reports

CLI — `rri search`

CLI — `rri chat` (Interactive RAG)

🛠 Tech Stack

Backend

Technology	Purpose
FastAPI	Async REST API framework
SQLAlchemy 2.0	Async ORM with PostgreSQL
Celery	Distributed task queue
Qdrant	Vector similarity search engine
Sentence Transformers	BGE text embeddings
Ollama	Local LLM inference (Llama 3)
OpenAI API	Cloud LLM (GPT-4o)
Alembic	Database migrations
Pydantic v2	Data validation & settings

Frontend

Technology	Purpose
Next.js 14	React framework (App Router)
TypeScript	Type-safe JavaScript
TailwindCSS	Utility-first styling
Recharts	Data visualization charts
react-force-graph-2d	Knowledge graph visualization
Lucide React	Icon library
Axios	HTTP client

Infrastructure

Technology	Purpose
Docker Compose	Multi-container orchestration
PostgreSQL 16	Relational database
Redis 7	Caching & Celery message broker
Qdrant	Vector embeddings storage
Ollama	Self-hosted LLM runtime

🚀 Quick Start

Prerequisites

Docker & Docker Compose (v2.0+)
Git
(Optional) GitHub Personal Access Token for higher API rate limits
(Optional) OpenAI API key for cloud LLM features

1. Clone & Configure

git clone https://github.com/nhdandz/ResearchRover.git
cd ResearchRover
cp .env.example .env
# Edit .env with your API keys

2. Start All Services

make up

This launches 8 containers: app, worker, beat, postgres, redis, qdrant, ollama, frontend.

3. Initialize

make migrate          # Run database migrations
make pull-model       # Download Ollama LLM model
make seed             # (Optional) Seed demo data

4. Access

Service	URL
🌐 Frontend	http://localhost:3000
⚡ Backend API	http://localhost:8000
📖 API Docs	http://localhost:8000/docs
🔍 Qdrant	http://localhost:6333/dashboard

📚 For detailed setup, environment variables, and local development: see Configuration Guide

🗺 Roadmap

🔔 Real-time alerting with email/Slack notifications
📈 Advanced trend analysis with time-series visualization
🌍 Multi-language support (Vietnamese paper sources already integrated)
📱 Mobile-responsive PWA
🔗 BibTeX export and Zotero integration
🧩 Plugin system for custom data sources
📊 Comparative analysis dashboards
🤝 Team collaboration features

📚 Documentation

Document	Description
CLI Tool	Full CLI reference with all commands and options
API Reference	REST API endpoints, authentication, examples
Architecture	System design, data pipeline, project structure
Configuration	Environment variables, local dev setup, testing
Deployment	Docker deployment, Cloudflare Tunnel, VPS guide
Contributing	How to contribute, code style, PR process

📄 License

This project is licensed under the MIT License — see the LICENSE file for details.

Built with ❤️ for the research community
_{If you find RRI useful, consider giving it a ⭐ on GitHub!}

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
docs		docs
frontend		frontend
migrations		migrations
reports		reports
scripts		scripts
src		src
tests		tests
.env.example		.env.example
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
alembic.ini		alembic.ini
docker-compose.yml		docker-compose.yml
pyproject.toml		pyproject.toml

Folders and files

Latest commit

History

Repository files navigation

🔬 RRI — Research & Repository Intelligence

📖 Overview

🎯 Who is RRI for?

⚡ Key Highlights

✨ Features

📄 Multi-Source Data Collection

🔍 Semantic Search

🤖 AI-Powered Chat (RAG)

📊 Analytics & Trending

📋 Knowledge Management

🔐 Authentication & Multi-User

🌙 Modern UI/UX

💻 CLI Tool

📸 Screenshots

Landing Page

Dashboard

Papers — Overview & Analytics

Papers — Browse & Filter

Paper Detail

Semantic Search

AI Chat (RAG)

Trending

Community & OpenReview

HuggingFace Models

Repositories

OpenReview

My Library

Weekly Reports

CLI — rri search

CLI — rri chat (Interactive RAG)

🛠 Tech Stack

Backend

Frontend

Infrastructure

🚀 Quick Start

Prerequisites

1. Clone & Configure

2. Start All Services

3. Initialize

4. Access

🗺 Roadmap

📚 Documentation

📄 License

About

Topics

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

CLI — `rri search`

CLI — `rri chat` (Interactive RAG)

Packages