Media Studio

Media Studio is a local AI artist studio for images and videos.

It gives you a gallery, prompt box, source-image slot, model picker, presets, and local output history in one place. You run the app locally, keep your prompts and outputs on your own machine, and connect it to Kie AI for pay-as-you-go model access.

Kie AI, pronounced "key AI," is the external AI model marketplace and API platform behind Media Studio. It gives developers access to image, video, music, speech, and LLM models through one provider, and its public pages describe a credit-based pay-as-you-go system instead of a monthly subscription. Current Kie AI pages say entry-level purchases start at $5, and some model pages describe 1,000 credits for $5, with each model consuming its own amount of credits.

If you want the fastest path first:

START_HERE.md

What Is This About?

This repo is for people who want their own image and video studio instead of another subscription-hosted tool website.

You run the dashboard locally, connect your own Kie AI key, and get:

your own gallery-style studio UI
your own presets and prompt workflows
your own queue, jobs, and output history
your own local files and artifacts

It is local-first by design. It is not trying to be a one-click hosted SaaS template.

Important: the dashboard and queue run locally, but the image and video models do not run on your machine. Media Studio sends generation jobs to Kie AI, which is the external model marketplace and provider used for live generation.

The local control API is intentionally private by default. The setup scripts generate a unique local control token and keep the Studio locked to localhost unless you explicitly configure browser credentials.

If you want to open Studio from a private LAN or TailScale network without browser auth, set:

MEDIA_STUDIO_ALLOW_PRIVATE_NETWORK_ACCESS=true

That allows private-network access only. It does not open Studio to arbitrary public internet traffic.

Why Was It Built?

Most AI media tools make you rent the whole product just to access the models.

Media Studio was built to make that layer yours instead:

the app and workflow stay under your control
the backend is real Python, not a toy mock layer
the pricing stays usage-based through Kie AI
image and video generation live in one place

What Does It Use Under the Hood?

Next.js for the dashboard and browser-facing routes
FastAPI for the local control API
SQLite for jobs, batches, presets, queue state, and local metadata
kie-api for model registry, request validation, pricing, submit, polling, and artifacts
local filesystem storage for uploads, downloads, and generated outputs

The main repo layout is:

apps/
  api/   FastAPI backend
  web/   Next.js frontend
scripts/
docs/
data/

Key supporting paths:

scripts/ local run, verification, and maintenance commands
docs/ operator and architecture notes
data/ local SQLite, uploads, downloads, generated outputs, and runtime files
sibling kie-api/ checkout for validation, pricing, submit, polling, and artifact publishing

What Goes Into The Shared Python Venv?

The setup flow creates one shared Python virtualenv in the sibling kie-api checkout.

The bootstrap step installs:

upgraded packaging tools: pip, setuptools, wheel
editable kie-api
editable media-studio-api
the apps/web workspace dependencies needed to run the local Next.js dashboard

That means the shared venv ends up with the local packages plus the API dependencies they declare, including:

fastapi
uvicorn[standard]
pydantic
python-multipart
httpx
Pillow
PyYAML

For normal app usage, the important runtime pieces are the two local packages plus the FastAPI stack. The packaging tools are there so editable installs work cleanly.

Test-only packages such as pytest and pytest-asyncio are installed later by the quality/release verification scripts, not by the basic onboarding path.

On the Node side, onboarding installs the web app workspace only. Test tooling such as Vitest and browser smoke tooling such as Playwright are part of the release/test path, not the normal user setup path.

What Provider Are We Using?

Right now the live generation path is Kie AI, pronounced "key AI."

That means:

you bring your own KIE_API_KEY
Media Studio uses the shared kie-api layer to talk to Kie AI
pricing, validation, and request normalization are driven from the Kie AI-backed registry
the models are executed remotely through Kie AI, not locally on your Mac, Linux box, or Windows machine

Kie AI is the model marketplace behind the app. It uses a credit-based, pay-as-you-go system instead of a monthly subscription.

As of April 3, 2026, Kie AI pages describe entry-level credit purchases starting at $5, and some current model pages cite 1,000 credits for $5. Different models consume different amounts of credits, so image and video jobs do not all cost the same. Check the provider site before making pricing promises, because Kie AI can change packs and pricing over time.

Get your Kie AI key here:

kie.ai

How Does The System Work?

At a high level, the system works like this:

you browse the gallery and open the Studio composer
you choose a model, add a prompt, and optionally attach a source image
you can use a preset to fill in a repeatable workflow instead of starting from scratch
the local app validates and stores the job
Kie AI runs the model remotely and sends back the result
the finished output lands back in your gallery and local files

See docs/request-lifecycle.md for the full submit, queue, polling, publish, and retry lifecycle.

That is the main idea: local studio experience, remote model execution, local history.

What Models Are In The Studio Right Now?

Current image models:

nano-banana-2 General image generation and image editing. This is the default image model in the Studio.
nano-banana-pro Higher-end Nano Banana variant for image generation and image editing.

Current video models:

kling-2.6-t2v Text-to-video generation from a prompt only.
kling-2.6-i2v Image-to-video generation from a single starting image.
kling-3.0-t2v Newer Kling text-to-video flow.
kling-3.0-i2v Newer Kling image-to-video flow, including first/last-frame style input handling in the Studio.
kling-3.0-motion Motion-control workflow for guiding video movement from source media.

The exact pricing and request rules can change over time, so the app also exposes:

/pricing in the dashboard
GET /media/pricing in the control API

Nano Banana And Presets

Nano Banana is the core image workflow in the Studio right now.

The app currently ships with:

nano-banana-2 as the default image model
nano-banana-pro as the higher-end image variant
shared built-in Nano Banana presets to show how guided image workflows work out of the box

The preset system is one of the best parts of the product.

A preset is not just a saved prompt. A preset can define:

which models it applies to
a reusable prompt template
structured text inputs like names, characters, scenes, or style fields
required image slots such as a portrait or reference image
default options that should be applied automatically
thumbnails, notes, and model-specific guidance

That means you can build repeatable workflows instead of rewriting the same prompt every time.

In practice, presets make the studio feel more like a small creative tool than a raw API front end.

Examples already seeded into the app:

3D Caricature Style Upload a portrait and turn it into a stylized 3D caricature.
Selfie with Movie Character Upload your photo, add an actor and movie name, and generate a guided selfie-style composition.

How Do I Set This Up Quickly?

Minimum requirements:

git
python3
Node.js LTS (includes npm)
KIE_API_KEY from Kie AI

If you need install help first:

docs/prerequisites.md

macOS

If Git is missing, install Apple Command Line Tools first:

xcode-select --install

If Node.js is missing, install the current LTS release from:

nodejs.org

git clone https://github.com/gateway/media-studio.git
cd media-studio
./scripts/onboard_mac.sh

That script:

clones or reuses the shared kie-api repo
creates the shared Python virtualenv
installs Python and web dependencies
creates .env
creates a clean local database
prompts for KIE_API_KEY
asks whether you want to enable optional prompt enhancement now
lets you skip prompt enhancement and add it later in Settings
creates a simple Mac start/stop flow for normal users
checks for existing local Studio processes or blocked ports before launching
can open Media Studio for you immediately when setup finishes

After setup, the easiest way to reopen the app later is to double-click:

On macOS, Start Media Studio.command uses one launcher Terminal window, starts both the API and web processes in production mode, waits for Studio to be ready, and opens the browser directly to /studio.

If you accidentally close that launcher window or something gets stuck, the easiest recovery path is still:

double-click Stop Media Studio.command
double-click Start Media Studio.command

The Mac launcher also tries to recover automatically if only part of the local app is still running.

If you prefer the same production-style local run from Terminal, use:

./scripts/run_studio_mac.sh

Open:

http://127.0.0.1:3000/studio

Private LAN / TailScale access

If you want to open Studio from your phone or another device on your private network, add this to .env:

MEDIA_STUDIO_ALLOW_PRIVATE_NETWORK_ACCESS=true

Then restart Studio.

For TailScale or LAN access in development mode, also set:

MEDIA_STUDIO_WEB_HOST=0.0.0.0

The browser auth alternative is still:

MEDIA_STUDIO_ADMIN_USERNAME=your_user
MEDIA_STUDIO_ADMIN_PASSWORD=your_password

Developer mode

If you are actively changing the code and want hot reload, use the dev scripts instead:

npm run dev:api
./scripts/dev_web.sh

That path is for development only. It runs the web app in Next.js dev mode, so you may see dev-only UI such as the Next badge or debug overlays.

Media Studio will still route you to the right first page:

first run or incomplete setup -> /setup
ready system with models loaded -> /studio

The first real step to use the models is simple:

create a Kie AI account and get your KIE_API_KEY
add that key during setup
start Media Studio
open /studio
choose a model or preset
submit your first job

The shortest version is:

Get a Kie AI key.
Run the setup script.
Open the Studio.
Pick a model.
Prompt and generate.

Prompt enhancement setup during onboarding now focuses on the hosted OpenRouter path. If you ever want to use a local OpenAI-compatible enhancer instead, you can switch that later in Settings.

Linux

The macOS installer is macOS-only. For Linux, use the shared bootstrap directly:

git clone https://github.com/gateway/media-studio.git
cd media-studio
./scripts/bootstrap_local.sh

Then add your KIE_API_KEY to .env and run:

npm run dev:api
./scripts/dev_web.sh

Windows

git clone https://github.com/gateway/media-studio.git
cd media-studio
powershell -ExecutionPolicy Bypass -File .\scripts\onboard_windows.ps1

Detailed setup docs:

Prompt Enhancement

Prompt enhancement is optional.

You can use Media Studio with only KIE_API_KEY and nothing else.

If you want prompt rewriting or enhancement before generation, you can also configure:

OPENROUTER_API_KEY for hosted prompt enhancement
MEDIA_LOCAL_OPENAI_BASE_URL for a local OpenAI-compatible endpoint
MEDIA_LOCAL_OPENAI_API_KEY if that local endpoint requires auth

The macOS onboarding flow asks if you want to enable prompt enhancement now, verifies the OpenRouter key if you do, and lets you skip the whole step and add it later in Settings.

By default, the recommended OpenRouter enhancement model is qwen/qwen3.5-35b-a3b, and the enhancement layer can also work with supported multimodal models when you want image-aware prompt help.

These are helpers for prompt quality. They are not required for the core image or video generation flow.

Miscellaneous Things To Know

The app is local-first and works best as a localhost studio.
The shared Python runtime lives in the sibling kie-api checkout, so Media Studio does not need its own separate Python venv.
The setup flow supports both ../kie-api and legacy ../kie-ai/kie_codex_bootstrap layouts.
If you skip KIE_API_KEY during setup, the app still installs, but live generation stays off until you add the key.
The local app stores prompts, jobs, and output files on your machine, but the actual model generation happens through Kie AI.
Runtime files such as databases, downloads, uploads, and outputs stay local and should not be committed.

For deeper runtime details:

docs/runtime-and-supervision.md

More Docs

If you are a person setting this up for the first time:

START_HERE.md

If you are pointing an LLM or another helper at the project and want the fastest onboarding context:

Name		Name	Last commit message	Last commit date
Latest commit History 149 Commits
.github/workflows		.github/workflows
apps		apps
docs		docs
ops		ops
outputs		outputs
scripts		scripts
specs		specs
.env.example		.env.example
.gitignore		.gitignore
README.md		README.md
START_HERE.md		START_HERE.md
Start Media Studio.command		Start Media Studio.command
Stop Media Studio.command		Stop Media Studio.command
package-lock.json		package-lock.json
package.json		package.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Media Studio

What Is This About?

Why Was It Built?

What Does It Use Under the Hood?

What Goes Into The Shared Python Venv?

What Provider Are We Using?

How Does The System Work?

What Models Are In The Studio Right Now?

Nano Banana And Presets

How Do I Set This Up Quickly?

macOS

Private LAN / TailScale access

Developer mode

Linux

Windows

Prompt Enhancement

Miscellaneous Things To Know

More Docs

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Media Studio

What Is This About?

Why Was It Built?

What Does It Use Under the Hood?

What Goes Into The Shared Python Venv?

What Provider Are We Using?

How Does The System Work?

What Models Are In The Studio Right Now?

Nano Banana And Presets

How Do I Set This Up Quickly?

macOS

Private LAN / TailScale access

Developer mode

Linux

Windows

Prompt Enhancement

Miscellaneous Things To Know

More Docs

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages