Add AI Assistant Feature with OpenAI Integration #4873

MarsPresLai · 2025-12-11T07:49:04Z

Summary

This PR adds a comprehensive AI-powered assistant to the Open OnDemand dashboard that helps users manage HPC tasks through natural language conversations. The assistant appears as a floating chat bubble in the bottom-right corner of all dashboard pages.

Motivation

New HPC users often face a steep learning curve when learning to manage jobs, navigate file systems, and understand cluster resources. This AI assistant provides:

Natural language interface that reduces friction for new users
Instant help without context-switching to documentation
Faster workflows for power users performing common tasks
Improved overall user experience and accessibility

Features

Core Capabilities

Job Management: List, view details, and delete jobs across clusters
File Operations: Browse directories, read files, create files, and submit batch jobs
Cluster Information: View cluster status and job queue statistics
Interactive Sessions: Monitor active sessions (Jupyter, Desktop, etc.)
Batch Job Submission: Create and submit job scripts through natural language

User Interface

Floating purple chat bubble (always accessible)
Clean, modern chat interface (380px × 520px)
Conversation history with context awareness
Markdown formatting support (code blocks, bold, italic, links)
Responsive design (mobile-friendly)
Dark mode support
Loading indicators and error handling

Technical Implementation

Backend: Rails controller with OpenAI function calling (12 tools)
Frontend: Vanilla JavaScript widget (no dependencies)
Integration: Inline partial to bypass asset pipeline complexity
Security: CSRF protection, user permission checks, path validation

Architecture

┌─────────────────────────────────────────┐
│           Frontend (Browser)             │
│  • Floating chat bubble                  │
│  • Inline HTML/CSS/JavaScript            │
│  • CSRF token handling                   │
└─────────────────┬───────────────────────┘
                  │ POST /pun/sys/dashboard/assistant/chat
                  ▼
┌─────────────────────────────────────────┐
│      AssistantController (Rails)         │
│  • OpenAI API integration                │
│  • 12 tools (9 read + 3 write)          │
│  • Conversation history management       │
└─────────────────┬───────────────────────┘
                  │
        ┌─────────┴─────────┐
        ▼                   ▼
┌──────────────┐   ┌──────────────────┐
│  OpenAI API  │   │  OOD Core Libs   │
│  • GPT-4o    │   │  • Job adapters  │
│  • Tools     │   │  • File ops      │
└──────────────┘   └──────────────────┘

Configuration

Requirements

OpenAI API key (required)
Rails 7.1.6+ (already in OOD)
No additional gems or JavaScript dependencies

Setup

# Set API key in environment
echo "OPENAI_API_KEY=your-api-key-here" >> /etc/ood/config/apps/dashboard/env

# Restart web server
sudo systemctl restart httpd

Optional Configuration

# Change OpenAI model (default: gpt-4o-mini)
OPENAI_MODEL=gpt-4o

Tools Available

The assistant has access to 12 tools for HPC operations:

Read Operations:

list_jobs - List all jobs across clusters
get_job_details - Get detailed job information
delete_job - Delete/cancel jobs (with user confirmation)
list_files - Browse directories
read_file - Read text file contents (first 100 lines)
get_file_info - Get file metadata
get_cluster_status - View cluster information
list_interactive_sessions - List active sessions
get_available_apps - List launchable apps

Write Operations:
10. create_file - Create files with content
11. submit_batch_job - Submit existing job scripts
12. create_and_submit_job - Create script + submit in one step

Usage Examples

Example 1: Check Running Jobs

User: Show me my running jobs
Assistant: You have 3 running jobs:

| Job ID | Name | Cluster | Queue | Nodes |
|--------|------|---------|-------|-------|
| 12345 | simulation | pitzer | batch | 4 |

Example 2: Create and Submit Job

User: Create a simple hello world job script and submit it
Assistant: I'll create a job script for you and submit it.

Job script created at /home/user/hello_job.sh and 
submitted as job 12348 to pitzer cluster.

Example 3: Browse Files

User: What's in my home directory?
Assistant: Here are the contents of /home/user:

projects/ (directory)
data/ (directory)
config.yaml (1.2 KB)

Files Changed

New Files

apps/dashboard/app/controllers/assistant_controller.rb (465 lines) - Backend LLM agent with OpenAI integration
apps/dashboard/app/views/layouts/_assistant.html.erb (320 lines) - Inline widget with HTML/CSS/JS
apps/dashboard/app/javascript/assistant.js (295 lines) - Modular JavaScript widget (for asset pipeline builds)
apps/dashboard/app/assets/stylesheets/assistant.css (360 lines) - Widget styles with dark mode support
DOCKER_SETUP.md (265 lines) - Docker development environment guide
apps/dashboard/AI_ASSISTANT.md (370 lines) - Complete feature documentation
COMMIT_SUMMARY.md (437 lines) - Detailed implementation notes

Modified Files

apps/dashboard/config/routes.rb - Added assistant routes (POST /assistant/chat, GET /assistant/status)
apps/dashboard/app/assets/stylesheets/application.scss - Import assistant styles
apps/dashboard/app/javascript/application.js - Import assistant module
.gitignore - Exclude cookies and session files

Security Considerations

Authentication: Runs in user's authenticated session (no separate auth)
Authorization: Uses user's existing permissions (no privilege escalation)
Path Validation: File operations restricted to user's home directory
CSRF Protection: Rails CSRF token validation on all requests
No Persistence: Conversation history is client-side only (privacy)
Input Validation: All tool arguments validated before execution
Read-Only by Default: Most tools are read-only; write operations clearly separated

Testing

All testing was performed manually in a Docker development environment.

Test Coverage

✅ Purple bubble appears on all dashboard pages
✅ Chat window opens/closes correctly
✅ Messages display in correct order with proper formatting
✅ All 12 tools execute successfully
✅ Error handling displays user-friendly messages
✅ Responsive design works on mobile viewports
✅ CSRF token validation prevents unauthorized requests
✅ Markdown formatting (code blocks, lists, links) renders correctly
✅ Graceful degradation when API key not configured

Test Scenarios Verified

Job Management: List jobs, view details, delete jobs across multiple clusters
File Operations: Browse directories, read files, create new files
Batch Jobs: Create and submit job scripts via natural language
Error Handling: Invalid paths, missing permissions, API failures
Security: Path traversal attempts blocked, user permissions enforced
UI/UX: Loading states, conversation history, markdown rendering

Manual Test Commands

# 1. Test status endpoint
curl http://localhost/pun/sys/dashboard/assistant/status

# 2. Test chat endpoint
curl -u username:password http://localhost/pun/sys/dashboard/assistant/chat \
  -H "Content-Type: application/json" \
  -d '{"message": "What jobs are running?"}'

# 3. Browser testing
# - Login to dashboard
# - Click purple bubble in bottom-right corner
# - Send messages: "Show me my jobs", "List files in my home directory"
# - Verify responses are accurate and well-formatted

Documentation

apps/dashboard/AI_ASSISTANT.md: Complete feature documentation with usage examples, customization guide, troubleshooting, and API reference
DOCKER_SETUP.md: Docker development environment setup including AI assistant configuration
COMMIT_SUMMARY.md: Detailed implementation notes and architectural decisions

Known Limitations

No Conversation Persistence: History is client-side only (refreshing page loses context)
Rate Limiting: No built-in rate limiting (consider adding for production use)
Cost: OpenAI API calls have usage costs (recommend monitoring)
File Size: File reading limited to 100KB (prevents memory issues)
No Streaming: Responses sent in full (not streamed token-by-token)

Future Enhancements

Potential improvements for future PRs:

Conversation persistence in database/session
Streaming responses for faster perceived performance
Per-user rate limiting
Pluggable tool system for custom site-specific tools
Support for alternative LLM providers (Anthropic, local models)
Pre-built job template library
Batch operations (e.g., cancel multiple jobs at once)
Proactive notifications for job status changes

Breaking Changes

None. This is a new optional feature that:

Does not modify existing functionality
Requires explicit opt-in via API key configuration
Has no runtime dependencies beyond OpenAI (when enabled)
Can be completely disabled by not setting the API key
Gracefully degrades when disabled (no errors shown to users)

Deployment Notes

No Database Migrations: Feature uses existing OOD infrastructure
No Gem Changes: Uses only standard Ruby/Rails libraries (net/http, json)
Asset Pipeline: Uses inline partial approach to avoid precompilation complexity
Backwards Compatible: Works with existing OOD installations
Zero Downtime: Can be deployed without service interruption (restart recommended to load new code)
No External Services Required: OpenAI API only used when assistant is invoked

Code Style Compliance

This PR follows the OOD Contributing Guidelines:

Ruby Style

✅ Snake_case for methods and variables
✅ 2-space indentation, no tabs
✅ Attributes defined with attr_reader (read-only objects)
✅ Comments explaining intent at class/method level
✅ Meaningful variable names (no single letters except block iterators)
✅ Methods use ? suffix for boolean returns
✅ Implicit begin blocks in rescue statements
✅ Files end with newline

JavaScript Style

✅ File names use underscores (assistant.js)
✅ camelCase for variables and functions
✅ const and let (no var)
✅ ES6 class syntax for modularity

CSS Style

✅ Class names use hyphens (ood-assistant-bubble)
✅ Relative sizes (em, rem) over pixels where appropriate
✅ Responsive breakpoints for mobile

HTML Style

✅ IDs use underscores (ood-assistant-input)
✅ Semantic HTML5 elements

Checklist

Additional Notes

Why Inline Partial Instead of Asset Pipeline?

This implementation uses an inline partial (_assistant.html.erb) rather than fully compiled assets because:

Production Complexity: Precompiled assets with fingerprints make live updates difficult in production
Container Deployment: Easier to update without full asset recompilation
Self-Contained: Single file contains all HTML/CSS/JS for easier maintenance
Faster Development: Simpler to modify and test without build steps

The trade-off is a slightly larger HTML payload (~10KB), but the benefit is significantly easier deployment and maintenance. The separate .js and .css files are included for sites that prefer to use the asset pipeline.

Features: - Floating chat bubble widget on all dashboard pages - LLM-powered assistant with 12 HPC management tools - Job management (list, view details, delete) - File operations (browse, read, create, submit jobs) - Cluster status and interactive session monitoring - Inline implementation to bypass asset pipeline complexity Components: - AssistantController: Backend with OpenAI function calling - JavaScript widget: Vanilla JS chat interface - CSS styling: Purple gradient theme with dark mode support - Routes: POST /assistant/chat, GET /assistant/status Configuration: - Requires OPENAI_API_KEY environment variable - Supports custom model selection via OPENAI_MODEL - Tool-based architecture for extensibility Documentation: - DOCKER_SETUP.md: Container development environment guide - AI_ASSISTANT.md: Feature documentation and customization guide

…kens

johrstrom · 2025-12-11T14:27:09Z

Hi, thank you for your contribution and interest in this project! We're currently trying to finish 4.1 development, so it may take a bit for this to get looked at.

One concern I've had with this idea is the OPENAI_API_KEY being readable and exposed to the user through the env file. I see this adds a lot of complexity to solve that by sending requests to rails then having rails send requests to the model.

It'll take me quite some time to look through the mutative actions that this controller can do.

Beyond that, I don't think we need all the .md files.

✅ IDs use underscores (ood-assistant-input)

This does seem wrong though, those are hypens (-) not underscores (_).

- Remove unnecessary documentation files - Add comprehensive security documentation to controller - Document API key protection (server-side only, never exposed to browser) - Document user isolation (operations use CurrentUser permissions) - Add comments explaining mutative actions and security model

MarsPresLai · 2025-12-12T04:02:33Z

Hello, thank you for your quick response! I updated the code based on the points above. If there is any other concern or modification needed, feel free to let me know.

Bubballoo3 · 2025-12-12T19:26:25Z

Hey, we are putting this on hold for now while we finish up the next release, and will revisit it after the new year. The deadline for external contributions has passed for 4.1, so we will get back to you with reviews once we wrap this up.

ktomko · 2025-12-17T16:43:31Z

There is potential to add a voice interface to this which could provide a lot of capability for OOD access without the need for a keyboard for accessible or mobile use cases.

MarsPresLai · 2025-12-18T08:02:00Z

@ktomko The feature sounds very useful. Would you be interested in working on this voice feature in this pr?

ktomko · 2025-12-18T18:39:53Z

@MarsPresLai To answer your question, no. I see an audio interface as a possible next step and think it makes sense to consider for a subsequent development effort.

MarsPresLai · 2025-12-19T02:38:44Z

@ktomko I see. Thanks for your great suggestion.

johrstrom · 2025-12-19T14:34:08Z

Thinking about this a bit more, my thinking is that interpreting an AI response and acting on it (mutating the system) is better suited to an AI agent. I'm weary of having lots of if/else blocks with regular expressions to do what effectively AI agents are built for.

MarsPresLai added 2 commits December 11, 2025 15:35

Add cookies.txt to gitignore to prevent accidental commits of auth to…

3683635

…kens

github-project-automation bot added this to PR Review Pipeline Dec 11, 2025

github-project-automation bot moved this to Awaiting Review in PR Review Pipeline Dec 11, 2025

osc-bot added the component/dashboard label Dec 11, 2025

add docs

5116862

Bubballoo3 added the status/on hold label Dec 12, 2025

moffatsadeghi moved this from Awaiting Review to On Hold in PR Review Pipeline Dec 15, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add AI Assistant Feature with OpenAI Integration #4873

Add AI Assistant Feature with OpenAI Integration #4873

Uh oh!

MarsPresLai commented Dec 11, 2025

Uh oh!

johrstrom commented Dec 11, 2025

Uh oh!

MarsPresLai commented Dec 12, 2025

Uh oh!

Bubballoo3 commented Dec 12, 2025

Uh oh!

ktomko commented Dec 17, 2025

Uh oh!

MarsPresLai commented Dec 18, 2025

Uh oh!

ktomko commented Dec 18, 2025

Uh oh!

MarsPresLai commented Dec 19, 2025

Uh oh!

johrstrom commented Dec 19, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Add AI Assistant Feature with OpenAI Integration #4873

Are you sure you want to change the base?

Add AI Assistant Feature with OpenAI Integration #4873

Uh oh!

Conversation

MarsPresLai commented Dec 11, 2025

Summary

Motivation

Features

Core Capabilities

User Interface

Technical Implementation

Architecture

Configuration

Requirements

Setup

Optional Configuration

Tools Available

Usage Examples

Example 1: Check Running Jobs

Example 2: Create and Submit Job

Example 3: Browse Files

Files Changed

New Files

Modified Files

Security Considerations

Testing

Test Coverage

Test Scenarios Verified

Manual Test Commands

Documentation

Known Limitations

Future Enhancements

Breaking Changes

Deployment Notes

Code Style Compliance

Ruby Style

JavaScript Style

CSS Style

HTML Style

Checklist

Additional Notes

Why Inline Partial Instead of Asset Pipeline?

Uh oh!

johrstrom commented Dec 11, 2025

Uh oh!

MarsPresLai commented Dec 12, 2025

Uh oh!

Bubballoo3 commented Dec 12, 2025

Uh oh!

ktomko commented Dec 17, 2025

Uh oh!

MarsPresLai commented Dec 18, 2025

Uh oh!

ktomko commented Dec 18, 2025

Uh oh!

MarsPresLai commented Dec 19, 2025

Uh oh!

johrstrom commented Dec 19, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants