Pruna endpoints

pruna is a model laboratory for production AI. We create our own performance models (below) and host optimized models for you on partner inference platforms—for example Replicate, Prodia, Runpod, Segmind, DeepInfra, Wiro, WaveSpeed, inference.sh, Eachlabs, Scenario, and Runway—as well as on Pruna. These endpoints are optimized for speed, efficiency, and cost with Pruna. This makes them the fastest and most efficient way to run your models in production.

Performance models

What are performance models?

Performance models come out of our model laboratory: models we create, optimize, host, and serve ourselves. We optimize them for the Pareto front of speed, cost, and quality. We optimize and offer as serverless well-known open source models for production use. Additionally, we have our own performance models like P-Image, P-Image-Edit, P-Image-Upscale, P-Video, and P-Video-Avatar. Get API access via the Pruna User Portal.

P-Image

A performance text-to-image model delivering AI images in under one second, combining speed, quality, prompt adherence, and reliable text rendering.

P-Image

P-Image-Edit

A state-of-the-art image editing model, offering fast, high-quality multi-image editing with excellent prompt following and text rendering.

P-Image-Edit

P-Image-Try-On

Virtual garment try-on: dress a person photo from flat-lay, product, worn, or multi-garment references — up to eleven categories per request — while preserving face, pose, and background.

P-Image-Try-On

P-Image-Upscale

↔ Drag starting image p-image-upscale

General quality improvements for finished generations and edits: sharper details, cleaner text, and a more polished final image without changing the composition.

P-Image-Upscale

P-Video

A performance video generation model delivering state-of-the-art AI video in seconds, with support for long-form generation, image references, and audio syncing.

P-Video

P-Video-Avatar

A performance avatar video model that generates talking spokesperson videos from one image using scripts or uploaded audio, with multilingual voice support.

P-Video-Avatar

P-Video-Animate

Animate a single image with motion, timing, and camera movement from a reference video—fastest cost-efficient motion transfer at 720p and 1080p.

P-Video-Animate

P-Video-Replace

Replace characters in existing video while preserving motion, camera, and scene structure—fast, affordable character swap on the p-video-replace endpoint.

P-Video-Replace

LoRA Training and Inference

P-Image LoRA and P-Image-Edit LoRA: Training and Inference (two workflows).

LoRA Training and Inference

Sign Up

Get API access. Create an account and run inference via the Pruna User Portal.

https://dashboard.pruna.ai/login

API Reference

API Reference for the performance models.

https://docs.api.pruna.ai/guides/quickstart

All Models & Pricing

All models and pricing for our models.

https://www.pruna.ai/all-models

Why performance models?

Pruna endpoints offer significant advantages over running your own model endpoints from scratch — thanks to our integrated optimizations and cloud infrastructure partnerships. Our performance models are optimized for the Pareto front of speed, cost, and quality. They provide:

Faster: Models are hosted and optimized for speed using the latest optimization algorithms.
Cheaper: Model optimizations reduce hardware requirements and reduce costs.
Better: Good optimizations can be lossless and those are our specialty.

Tip

Check out our benchmark comparison page for a head-to-head look at latency and price compared to self-hosting and other public endpoints. See how much time and money you can save.

General guides

Guides for overlapping topics:

Fix AI Bias in 5 Minutes

Learn to identify and correct bias in AI-generated images using p-image and p-image-edit.

Dealing with Bias and Diversity in media generation

Turn Any Image Into a Video in 60 Seconds

Generate with p-image, optionally refine with p-image-edit, then animate with p-video.

How to Improve P-Video with a First Frame from P-Image

Create a Music Video in 10 Minutes

From lyrics to full music video with MiniMax Music 1.5, p-image, p-image-edit, and p-video.

How to Generate an AI Music Video with P-Video and MiniMax Music

Generate a Long AI Movie Using Scene Chaining

Use an LLM to craft prompts, then build images, edits, and video segments.

How to create a movie from one idea using scene chaining

Automated Prompt Engineering with VLMs and DSPy

Iteratively refine image prompts until the output matches your intent.

How to refine prompts until the image matches your intent

Build Multilingual Avatar Videos End-to-End

Build realistic spokesperson videos with p-video-avatar, including p-image start frames, voice variants, multilingual outputs, and cost-aware configuration presets.

P-Video-Avatar end-to-end

P-Image LoRA Training and Inference

A guide for P-Image LoRA Training and Inference (p-image-trainer, p-image-lora).

P-Image LoRA: Training and Inference

P-Image-Edit LoRA Training and Inference

A guide for P-Image-Edit LoRA Training and Inference (p-image-edit-trainer, p-image-edit-lora).

P-Image-Edit LoRA: Training and Inference

P-Image Upscaling: Quality Improvements

Learn how to use p-image-upscale for general quality improvements, then apply it as a final polish pass after generation or editing.

Use P-Image-Upscale for General Quality Improvements and Final Polish

Prompting guides

Quick reference one-sheet guides for immediate use:

Cheatsheet

Get a quick overview of all the modalities and their overall prompt structure.

Cheatsheet

Image generation

Learn all the nuances of image generation prompts.

Image Generation

Image editing

Learn all the nuances of image editing prompts.

Image Editing

Video generation

Learn all the nuances of video generation prompts.

Video Generation