🎨 Multi-Model AI Content Generator

Local AI application with multiple state-of-the-art models for image and video generation. Features universal GPU support (NVIDIA, AMD, CPU) with an intuitive web interface.

🚀 Quick Start

Install Python 3.8+ from python.org (check "Add to PATH")

Clone this repository

git clone https://github.com/TriggeredBanana/Multi-Model-AI-Generator.git
cd Multi-Model-AI-Generator

Run the application
```
start.bat
```
Open your browser at http://localhost:7860

The launcher automatically installs dependencies and configures your GPU!

📊 Available Models

Model	Type	Download	Disk	Speed	Best For	Auth
Flux Schnell	Image	~50-70GB	23GB	Fast	Photorealistic images	✅
SDXL	Image	~140-150GB	72GB	Very Fast	Artistic styles	❌
SD3 Medium	Image	~15-20GB	10GB	Fast	Text/logos	✅
Stable Video	Video	~15-20GB	10GB	Slow	Image→Video	❌
AnimateDiff	Video	~25-30GB	15GB	Slow	Text→Video	❌

Where to Find/Download Models

Models auto-download from Hugging Face when first used. No manual download needed!

Hugging Face Model Links:

Flux Schnell: black-forest-labs/FLUX.1-schnell (requires HF account & accepting license)
SDXL: stabilityai/stable-diffusion-xl-base-1.0 (no auth required)
SD3 Medium: stabilityai/stable-diffusion-3-medium-diffusers (requires HF account & accepting license)
Stable Video Diffusion: stabilityai/stable-video-diffusion-img2vid-xt (no auth required)
AnimateDiff: guoyww/animatediff-motion-adapter-v1-5-2 (no auth required)

Cache Location: Models save to ~/.cache/multi_model_ai/ by default (configurable in app's Advanced Management tab)

🎮 GPU Support

NVIDIA (CUDA):

Auto-detected for all CUDA-capable GPUs
Fastest performance (seconds to minutes)
GTX 1060 6GB+ or RTX series recommended
All models fully accelerated

AMD (DirectML):

RX 5000+ series with 8GB+ VRAM supported
Auto-configured on first run
2-5x faster than CPU mode
Run python setup_amd_gpu.py if issues occur

CPU Mode:

Works on any system as fallback
Auto-optimized for performance
Slower but reliable (15-60 minutes per image)

🛠️ Setup Instructions

First Time Setup

Start the app: Double-click start.bat or run in terminal
GPU detection: Automatic - check console for GPU status
For gated models (Flux Schnell, SD3 Medium):
- Create account at Hugging Face
- Get access token: Settings → Access Tokens
- Visit model pages (links above) and click "Agree and access repository"
- Enter token in app's "🔐 Authentication" tab
Download models: Go to "📥 Model Management" tab, select models to download
Start creating: Select model in generation tab, load it, and generate!

Using the Application

Image Generation:

Navigate to "🎨 Image Generation" tab
Choose model based on your needs:
- SDXL - Fast, artistic, no auth (great first choice!)
- Flux Schnell - Photorealistic, detailed scenes
- SD3 Medium - Best for text/logos in images
Click "Load Model" (takes 30-60 seconds from cache)
Enter your prompt and adjust settings
Click "Generate Image"
Images automatically save to generated_content/ folder

Video Generation:

Navigate to "🎬 Video Generation" tab
Choose model:
- Stable Video Diffusion - Animate existing images (upload an image)
- AnimateDiff - Generate videos from text prompts
Click "Load Model" and wait for initialization
Configure your input (image upload or text prompt)
Click "Generate Video"
Videos automatically save to generated_content/ folder

💻 System Requirements

Minimum:

Python 3.8 or higher
16GB RAM
50GB free disk space
Internet connection for model downloads
8GB VRAM (recommended) or CPU mode

Recommended:

Python 3.10+
32GB RAM
100GB SSD storage
12GB+ VRAM (RTX 3080/4070 or RX 6800XT/6950XT)
High-speed internet for faster downloads

Dependencies (auto-installed by start.bat):

torch, diffusers, transformers, accelerate
gradio, Pillow, numpy, huggingface-hub
opencv-python, imageio, imageio-ffmpeg
For AMD: onnxruntime-directml, optimum[onnxruntime]

🆘 Troubleshooting

Common Issues & Solutions

Download/Network Issues:

Run network_setup_helper.bat for comprehensive diagnostics
Use "🌐 Network Diagnostics" tab in the app
Try SDXL first (smallest download, no authentication)
Run start.bat as administrator if permission errors occur
Check firewall settings for Python

AMD GPU Not Detected:

python setup_amd_gpu.py

Update AMD drivers from AMD.com
Verify GPU in Device Manager (Display adapters)
Ensure 8GB+ VRAM available
Check for DirectML support in app console

Out of Memory Errors:

Try smaller model (SDXL uses less memory than Flux)
Lower image resolution (512x512 instead of 1024x1024)
Reduce number of inference steps
Close other GPU-intensive applications
Restart app to clear GPU memory

Authentication Errors:

Get token from Hugging Face Settings
Accept model licenses on HuggingFace (visit model pages)
Enter token in "🔐 Authentication" tab
Verify token has read permissions

Model Download Stuck:

Check internet connection stability
App includes automatic retry with exponential backoff
Monitor progress in Model Management tab
Large models can take 30-60 minutes on slow connections
Try downloading during off-peak hours

Model Won't Load:

Ensure model is fully downloaded (check Model Management tab)
Verify enough disk space in cache directory
Check GPU memory available (close other apps)
Restart application to clear cached memory
Try CPU mode if GPU memory insufficient

Helper Scripts

start.bat - Main launcher, handles all setup
setup_amd_gpu.py - AMD GPU detection and DirectML configuration
network_setup_helper.bat - Network and system diagnostics
setup_environment.bat - One-time Windows environment optimization

🎯 Tips for Best Results

Choosing the Right Model:

Quick iterations/artistic: Use SDXL (fastest, no auth)
Photorealistic portraits: Use Flux Schnell
Text/logos in images: Use SD3 Medium
Animate photos: Use Stable Video Diffusion
Text-to-video: Use AnimateDiff

Prompt Writing:

Be specific and detailed for Flux Schnell
Include art style for SDXL (e.g., "digital art", "oil painting")
Describe text content explicitly for SD3 Medium
Use negative prompts to avoid unwanted elements
Keep video prompts simple and focused

Performance Optimization:

Start with SDXL (smallest, fastest)
Store models on SSD for faster loading
Lower steps (20-30) for faster generation during testing
Use GPU mode for best performance
Close unnecessary applications to free up VRAM

Image Quality:

Use 1024x1024 resolution for best results
Increase steps (50+) for higher quality
Adjust guidance scale (7-15) to control prompt adherence
Use same seed for reproducible results
Generate multiple variations to find best output

📁 Project Structure

Multi-Model-AI-Generator/
├── multi_model_generator.py       # Main application with all models
├── requirements.txt               # Python dependencies
├── start.bat                      # Universal Windows launcher
├── setup_amd_gpu.py               # AMD GPU setup utility
├── network_setup_helper.bat       # Network diagnostics
├── network_diagnostics.py         # Python network testing
├── setup_environment.bat          # Environment optimization
├── LICENSE                        # MIT License
└── generated_content/             # Output directory (auto-created)

🎨 Model Usage Guide

Flux Schnell:

✅ Photorealistic portraits and landscapes
✅ Complex scenes with multiple objects
✅ Professional-quality outputs
✅ Excellent prompt following
❌ Requires authentication
❌ Larger download size

SDXL:

✅ Fast generation for iteration
✅ Artistic styles and concept art
✅ Great for beginners (no auth)
✅ Versatile and reliable
❌ Less photorealistic than Flux

SD3 Medium:

✅ Best for text rendering in images
✅ Logos, signs, typography
✅ Technical illustrations
✅ High-quality output
❌ Requires authentication

Stable Video Diffusion:

✅ High-quality image animation
✅ Smooth motion effects
✅ Professional video quality
❌ Requires input image
❌ Limited to 4-second clips
❌ Slow generation

AnimateDiff:

✅ Text-to-video generation
✅ Character animations
✅ Creative storytelling
❌ Lower quality than Stable Video
❌ Simple motions work best

📋 Credits & Acknowledgments

AI Models:

Flux Schnell by Black Forest Labs
SDXL by Stability AI
Stable Diffusion 3 Medium by Stability AI
Stable Video Diffusion by Stability AI
AnimateDiff by GuoYuWei

Powered By:

🤗 Hugging Face Diffusers - Model pipelines
🎛️ Gradio - Web interface
🔥 PyTorch - Deep learning framework

🤝 Contributing

Contributions welcome! Please follow these steps:

Fork the repository
Create a feature branch (git checkout -b feature/AmazingFeature)
Commit your changes (git commit -m 'Add AmazingFeature')
Push to the branch (git push origin feature/AmazingFeature)
Open a Pull Request

📄 License

MIT License - see LICENSE file for details.

📞 Support

Issues: GitHub Issues
Repository: https://github.com/TriggeredBanana/Multi-Model-AI-Generator
Discussions: GitHub Discussions

Enjoy creating amazing AI-generated content! 🎨✨

Name		Name	Last commit message	Last commit date
Latest commit History 46 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
cache_config.txt		cache_config.txt
cuda_settings.json		cuda_settings.json
install_pytorch_cu128.py		install_pytorch_cu128.py
install_pytorch_rtx5070ti.py		install_pytorch_rtx5070ti.py
launch_rtx5070ti.bat		launch_rtx5070ti.bat
launch_simple.py		launch_simple.py
multi_model_generator.py		multi_model_generator.py
network_diagnostics.py		network_diagnostics.py
network_setup_helper.bat		network_setup_helper.bat
requirements.txt		requirements.txt
requirements_rtx5070ti.txt		requirements_rtx5070ti.txt
rtx5070ti_setup_info.json		rtx5070ti_setup_info.json
setupCommands.txt		setupCommands.txt
setup_amd_gpu.py		setup_amd_gpu.py
setup_environment.bat		setup_environment.bat
setup_rtx5070ti.py		setup_rtx5070ti.py
setup_rtx5070ti_complete.py		setup_rtx5070ti_complete.py
start.bat		start.bat
start_rtx5070ti.bat		start_rtx5070ti.bat

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🎨 Multi-Model AI Content Generator

🚀 Quick Start

📊 Available Models

Where to Find/Download Models

🎮 GPU Support

🛠️ Setup Instructions

First Time Setup

Using the Application

💻 System Requirements

🆘 Troubleshooting

Common Issues & Solutions

Helper Scripts

🎯 Tips for Best Results

📁 Project Structure

🎨 Model Usage Guide

📋 Credits & Acknowledgments

🤝 Contributing

📄 License

📞 Support

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🎨 Multi-Model AI Content Generator

🚀 Quick Start

📊 Available Models

Where to Find/Download Models

🎮 GPU Support

🛠️ Setup Instructions

First Time Setup

Using the Application

💻 System Requirements

🆘 Troubleshooting

Common Issues & Solutions

Helper Scripts

🎯 Tips for Best Results

📁 Project Structure

🎨 Model Usage Guide

📋 Credits & Acknowledgments

🤝 Contributing

📄 License

📞 Support

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages