feat: Add PDF file upload support for AI-powered diagram generation #148

Biki-dev · 2025-12-07T05:38:27Z

📄 Overview

Implements PDF file upload functionality to enable users to upload academic papers, documentation, and other PDF files for AI-powered diagram generation.

🎯 Features

Phase 1: Universal Text Extraction

✅ PDF text extraction using pdf-parse library
✅ Works with ALL AI providers
✅ Preserves metadata (title, author, page count)
✅ Automatic text cleaning and normalization

Phase 2: Native File Upload

✅ Direct PDF upload for supported providers (Anthropic, OpenAI, Google, Vertex AI)
✅ Leverages Vercel AI SDK native file support
✅ Automatic provider capability detection
✅ Seamless fallback to text extraction for unsupported providers

Additional Enhancements

✅ Configurable via ENABLE_PDF_INPUT environment variable
✅ File size validation (default 5MB, configurable via MAX_FILE_SIZE)
✅ Support for multiple files (up to 5: PDFs + images)
✅ Enhanced UI with PDF preview placeholders
✅ File size indicators on hover
✅ Comprehensive error handling with user-friendly messages
✅ Configuration API endpoint at /api/config

🖼️ Screenshots

PDF Upload Interface:

Shows PDF file (2639KB) attached with remove button
File size displayed: "2639KB"
Icon changes to FileText when PDF support enabled

File Size Validation:

Proper error message: "File size exceeds 5MB limit"
User-friendly feedback
No crashes or silent failures

🔧 Technical Implementation

Provider Compatibility Matrix

Provider	Method	Phase
Anthropic	Native Upload	Phase 2
OpenAI	Native Upload	Phase 2
Google AI	Native Upload	Phase 2
Vertex AI	Native Upload	Phase 2
AWS Bedrock	Text Extraction	Phase 1
Azure OpenAI	Text Extraction	Phase 1
Ollama	Text Extraction	Phase 1
DeepSeek	Text Extraction	Phase 1
OpenRouter	Text Extraction	Phase 1

Architecture

User uploads PDF → Validate file → Check provider capability
                                    ↓
                    ┌───────────────┴───────────────┐
                    ↓                               ↓
            Native Support                  Text Extraction
         (Anthropic, OpenAI, etc.)         (Bedrock, Ollama, etc.)
                    ↓                               ↓
            Pass to AI as binary              Extract & pass as text
                    └───────────────┬───────────────┘
                                    ↓
                        AI processes content
                                    ↓
                        Generate diagram XML

📦 Files Changed

.env.example: Added ENABLE_PDF_INPUT configuration flag
app/api/chat/route.ts: PDF processing logic with provider detection
app/api/config/route.ts: New endpoint for feature configuration
components/chat-input.tsx: Enhanced UI with PDF support
lib/pdf-utils.ts: PDF utility functions (extraction, validation)
package.json: Added pdf-parse@^1.1.1 dependency

🧪 Testing

📝 Usage Example

# Enable PDF upload
echo "ENABLE_PDF_INPUT=true" >> .env.local

# Optional: Increase file size limit to 10MB
echo "MAX_FILE_SIZE=10485760" >> .env.local

# Restart server
npm run dev

🐛 Resolves

Closes #141

🚀 Deployment Notes

Required environment variables:

ENABLE_PDF_INPUT=true  # Required to enable the feature
MAX_FILE_SIZE=5242880  # Optional, defaults to 5MB

Dependencies to install:

npm install pdf-parse@^1.1.1

Ready for review! 🎉

vercel · 2025-12-07T05:38:30Z

@Biki-dev is attempting to deploy a commit to the dayuanjiang's projects Team on Vercel.

A member of the Team first needs to authorize it.

DayuanJiang · 2025-12-07T06:07:05Z

Thanks for you contribution! I will check it soon.

DayuanJiang · 2025-12-07T09:18:34Z

I have checked tht code.

The new attachments state in chat-input.tsx isn't connected to form submission. When user selects a file, it goes into local attachments state and shows the preview, but onFileChange() is never called - so the parent's files state stays empty. When the form submits, chat-panel.tsx reads from its own files state (line 463) which is always [].

Also noticed:

File size limit mismatch: client uses 5MB, server uses 2MB (route.ts line 22)
Using alert() instead of the existing toast system

Biki-dev · 2025-12-07T09:43:58Z

ok i try to resolve it ... thanks for review

feat: Add PDF file upload support for diagram generation

310a75e

fix: Add type definitions for pdf-parse

5ac0374

Biki-dev closed this Dec 8, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

feat: Add PDF file upload support for AI-powered diagram generation #148

feat: Add PDF file upload support for AI-powered diagram generation #148

Biki-dev commented Dec 7, 2025 •

edited

Loading

Uh oh!

vercel bot commented Dec 7, 2025

Uh oh!

DayuanJiang commented Dec 7, 2025

Uh oh!

DayuanJiang commented Dec 7, 2025

Uh oh!

Biki-dev commented Dec 7, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

feat: Add PDF file upload support for AI-powered diagram generation #148

feat: Add PDF file upload support for AI-powered diagram generation #148

Conversation

Biki-dev commented Dec 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

📄 Overview

🎯 Features

Phase 1: Universal Text Extraction

Phase 2: Native File Upload

Additional Enhancements

🖼️ Screenshots

🔧 Technical Implementation

Provider Compatibility Matrix

Architecture

📦 Files Changed

🧪 Testing

📝 Usage Example

🐛 Resolves

🚀 Deployment Notes

Uh oh!

vercel bot commented Dec 7, 2025

Uh oh!

DayuanJiang commented Dec 7, 2025

Uh oh!

DayuanJiang commented Dec 7, 2025

Uh oh!

Biki-dev commented Dec 7, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Biki-dev commented Dec 7, 2025 •

edited

Loading