Image, Video & Design

Generative art, video synthesis, design AI

96 tools · All-time leaderboard

← All Categories

What is Image, Video & Design?

Image, Video & Design covers 96 AI tools, with the top 10 averaging 65,475 community votes. 58 of the tools here are open source or have significant community traction. This page ranks them by all-time community votes, not by paid placement.

The current top three: yt-dlp, transformers, stable-diffusion. Each entry below shows the tool, its open-source stars or community size, and a short description from the project's own README. Click through to a full review for pricing, alternatives, and what it's actually good at.

Looking for something specific? Try the AI tool search engine — it indexes every tool on saas.pet and will surface what fits your workflow, not just what has the most votes.

🏆 Top 30 in Image, Video & Design

yt-dlp

A feature-rich command-line audio/video downloader

★ 171,312 votes

transformers

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

★ 161,696 votes

stable-diffusion

A latent text-to-image diffusion model

★ 73,127 votes

LocalAI

LocalAI is the open-source AI engine. Run any model - LLMs, vision, voice, image, video - on any hardware. No GPU required.

★ 48,082 votes

bark

🔊 Text-Prompted Generative Audio Model

★ 39,161 votes

awesome-system-design-resources

Learn System Design concepts and prepare for interviews using free resources.

★ 38,821 votes

diffusers

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.

★ 33,878 votes

CLIP

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

★ 33,799 votes

repomix

📦 Repomix is a powerful tool that packs your entire repository into a single, AI-friendly file. Perfect for when you need to feed your codebase to Large Language Models (LLMs) or other AI tools like ...

★ 27,553 votes

awesome-generative-ai-guide

A one stop repository for generative AI research updates, interview resources, notebooks and much more!

★ 27,324 votes

generative-models

Generative Models by Stability AI

★ 27,193 votes

peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

★ 21,279 votes

NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

★ 17,400 votes

OpenMontage

World's first open-source, agentic video production system. 12 pipelines, 52 tools, 500+ agent skills. Turn your AI coding assistant into a full video production studio.

★ 15,353 votes

latent-diffusion

High-Resolution Image Synthesis with Latent Diffusion Models

★ 14,067 votes

awesome-generative-ai

A curated list of modern Generative Artificial Intelligence projects and services

★ 12,156 votes

DALL-E

PyTorch package for the discrete VAE used for DALL·E.

★ 10,851 votes

hallmark

Anti-AI-slop design skill for Claude Code, Cursor, and Codex.

★ 9,193 votes

BentoML

The easiest way to serve AI apps and models - Build Model Inference APIs, Job queues, LLM apps, Multi-model pipelines, and more!

★ 8,747 votes

jukebox

Code for the paper "Jukebox: A Generative Model for Music"

★ 8,037 votes

design.md

A format specification for describing a visual identity to coding agents. DESIGN.md gives agents a persistent, structured understanding of a design system.

★ 7,524 votes

system_prompts_leaks

Extracted system prompts from Anthropic - Claude Fable 5, Opus 4.8, Claude Code, Claude Design. OpenAI - ChatGPT 5.5 Thinking, GPT 5.5 Instant, Codex. Google - Gemini 3.5 Flash, 3.1 Pro, Antigravity. ...

★ 7,149 votes

bifrost

Fastest enterprise AI gateway (50x faster than LiteLLM) with adaptive load balancer, cluster mode, guardrails, 1000+ models support & <100 µs overhead at 5k RPS.

★ 6,952 votes

harness-sdk

Build an agent harness and control it end-to-end. Open-source SDK for production AI agents in Python & TypeScript - any model, any cloud.

★ 6,759 votes

simplex-chat

SimpleX - the first messaging network operating without user identifiers of any kind - 100% private by design! iOS, Android and desktop apps 📱!

★ 5,995 votes

video-use

Edit videos with coding agents

★ 4,288 votes

claude-video

Give Claude the ability to watch any video. /watch downloads, extracts frames, transcribes, hands it all to Claude.

★ 4,128 votes

astryx

An open source design system that's fully customizable and agent ready

★ 4,087 votes

RuView

π RuView turns commodity WiFi signals into real-time spatial intelligence, vital sign monitoring, and presence detection — all without a single pixel of video.

★ 3,537 votes

awesome-generative-ai

A curated list of Generative AI tools, works, models, and references

★ 3,480 votes