Pruna endpoints

pruna hosts serverless endpoints for your models in collaboration with the inference providers in the industry. These endpoints are optimized for speed, efficiency, and cost with Pruna. This makes them the fastest and most efficient way to run your models in production.

Performance models

What are performance models?

Performance models are Pruna’s own optimized models that we host and serve. We optimize them for the Pareto front of speed, cost, and quality. We optimize and offer as serverless well-known open source models for production use. Additionally, we have our own performance models like P-Image, P-Image-Edit, and P-Video. Get API access via the Pruna User Portal.

P-Image: text-to-image
P-Image — Text-to-image
P-Image-Edit before P-Image-Edit after
P-Image-Edit — Image editing
P-Video — Video generation
P-Image

A performance text-to-image model delivering AI images in under one second, combining speed, quality, prompt adherence, and reliable text rendering.

P-Image
P-Image-Edit

A state-of-the-art image editing model, offering fast, high-quality multi-image editing with excellent prompt following and text rendering.

P-Image-Edit
P-Video

A performance video generation model delivering state-of-the-art AI video in seconds, with support for long-form generation, image references, and audio syncing.

P-Video
LoRA Training and Inference

P-Image LoRA and P-Image-Edit LoRA: Training and Inference (two workflows).

LoRA Training and Inference
Sign Up

Get API access. Create an account and run inference via the Pruna User Portal.

https://dashboard.pruna.ai/login
API Reference

API Reference for the performance models.

https://docs.api.pruna.ai/guides/quickstart
All Models & Pricing

All models and pricing for our models.

https://www.pruna.ai/all-models

Why performance models?

Pruna endpoints offer significant advantages over running your own model endpoints from scratch — thanks to our integrated optimizations and cloud infrastructure partnerships. Our performance models are optimized for the Pareto front of speed, cost, and quality. They provide:

  • Faster: Models are hosted and optimized for speed using the latest optimization algorithms.

  • Cheaper: Model optimizations reduce hardware requirements and reduce costs.

  • Better: Good optimizations can be lossless and those are our specialty.

Tip

Check out our benchmark comparison page for a head-to-head look at latency and price compared to self-hosting and other public endpoints. See how much time and money you can save.

General guides

Guides for overlapping topics:

Fix AI Bias in 5 Minutes

Learn to identify and correct bias in AI-generated images using p-image and p-image-edit.

Dealing with Bias and Diversity in media generation
Turn Any Image Into a Video in 60 Seconds

Generate with p-image, optionally refine with p-image-edit, then animate with p-video.

How to Improve P-Video with a First Frame from P-Image
Create a Music Video in 10 Minutes

From lyrics to full music video with MiniMax Music 1.5, p-image, p-image-edit, and p-video.

Create a Music Video in 10 Minutes (No Editing Skills)
Generate a Long AI Movie Using Scene Chaining

Use an LLM to craft prompts, then build images, edits, and video segments.

How to create a movie from one idea using scene chaining
Automated Prompt Engineering with VLMs and DSPy

Iteratively refine image prompts until the output matches your intent.

How to refine prompts until the image matches your intent
P-Image LoRA Training and Inference

A guide for P-Image LoRA Training and Inference (p-image-trainer, p-image-lora).

P-Image LoRA: Training and Inference
P-Image-Edit LoRA Training and Inference

A guide for P-Image-Edit LoRA Training and Inference (p-image-edit-trainer, p-image-edit-lora).

P-Image-Edit LoRA: Training and Inference

Prompting guides

Quick reference one-sheet guides for immediate use:

Cheatsheet

Get a quick overview of all the modalities and their overall prompt structure.

Cheatsheet
Image generation

Learn all the nuances of image generation prompts.

Image Generation
Image editing

Learn all the nuances of image editing prompts.

Image Editing
Video generation

Learn all the nuances of video generation prompts.

Video Generation