Image, Video & Design

Generative art, video synthesis, design AI

35 tools Β· All-time leaderboard

← All Categories

πŸ† Top 30 in Image, Video & Design

πŸ₯‡
GitHub

yt-dlp

A feature-rich command-line audio/video downloader

β˜… 171,312 votes
πŸ’¬ 0
πŸ₯ˆ
GitHub

transformers

πŸ€— Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

β˜… 161,696 votes
πŸ’¬ 0
πŸ₯‰
GitHub

stable-diffusion

A latent text-to-image diffusion model

β˜… 73,127 votes
πŸ’¬ 0
#4
GitHub

bark

πŸ”Š Text-Prompted Generative Audio Model

β˜… 39,161 votes
πŸ’¬ 0
#5
GitHub

awesome-system-design-resources

Learn System Design concepts and prepare for interviews using free resources.

β˜… 38,821 votes
πŸ’¬ 0
#6
GitHub

diffusers

πŸ€— Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.

β˜… 33,878 votes
πŸ’¬ 0
#7
GitHub

CLIP

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

β˜… 33,799 votes
πŸ’¬ 0
#8
GitHub

awesome-generative-ai-guide

A one stop repository for generative AI research updates, interview resources, notebooks and much more!

β˜… 27,324 votes
πŸ’¬ 0
#9
GitHub

generative-models

Generative Models by Stability AI

β˜… 27,193 votes
πŸ’¬ 0
#10
GitHub

peft

πŸ€— PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

β˜… 21,279 votes
πŸ’¬ 0
#11
GitHub

NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

β˜… 17,400 votes
πŸ’¬ 0
#12
GitHub

latent-diffusion

High-Resolution Image Synthesis with Latent Diffusion Models

β˜… 14,067 votes
πŸ’¬ 0
#13
GitHub

awesome-generative-ai

A curated list of modern Generative Artificial Intelligence projects and services

β˜… 12,156 votes
πŸ’¬ 0
#14
GitHub

DALL-E

PyTorch package for the discrete VAE used for DALLΒ·E.

β˜… 10,851 votes
πŸ’¬ 0
#15
GitHub

jukebox

Code for the paper "Jukebox: A Generative Model for Music"

β˜… 8,037 votes
πŸ’¬ 0
#16
GitHub

awesome-generative-ai

A curated list of Generative AI tools, works, models, and references

β˜… 3,480 votes
πŸ’¬ 0
#17
GitHub

chatwoot

Open-source live-chat, email support, omni-channel desk. An alternative to Intercom, Zendesk, Salesforce Service Cloud etc. πŸ”₯πŸ’¬

β˜… 2,166 votes
πŸ’¬ 1193
πŸ“… 2d
#18
GitHub

system_prompts_leaks

Extracted system prompts from Anthropic - Claude Fable 5, Opus 4.8, Claude Code, Claude Design. OpenAI - ChatGPT 5.5 Thinking, GPT 5.5 Instant, Codex. Google - Gemini 3.5 Flash, 3.1 Pro, Antigravity. ...

β˜… 1,220 votes
πŸ’¬ 0
πŸ“… 1d
#19
GitHub

pixel2motion

AI logo animation skill: turn raster logos into smooth SVG animation, animated HTML demos, GIF/video previews, and motion QA evidence.

β˜… 787 votes
πŸ’¬ 4
πŸ“… 2d
#20
Product Hunt

Framer 3.0

Agents bring AI to the canvas to help you design, write, analyze, and organize your sites. We’re also launching Branching, a new way for teams to explore ideas before they go live, and unveiling the n...

β˜… 386 votes
πŸ’¬ 17
πŸ“… 1d
#21
Product Hunt

Agentic videos by D-ID

Turn any video into an interactive AI experience. With Agentic Videos, viewers don't just watch - they pause, ask questions, get real-time answers, and interact with the presenter inside the video...

β˜… 156 votes
πŸ’¬ 52
πŸ“… 1d
#22
Product Hunt

Dualora

Multi-platform creators have a broken workflow: record landscape for YouTube, flip the phone, record again for Shorts. Or record once and spend 20 minutes cropping a second version that still looks of...

β˜… 107 votes
πŸ’¬ 12
πŸ“… 1d
#23
Product Hunt

Buddy

Buddy is the most powerful AI design agent inside Figma. And if you already pay for ChatGPT, plug it in and chat for free. No AI credits. Generate screens, flows, and variants on your canvas. Clone an...

β˜… 107 votes
πŸ’¬ 12
πŸ“… 1d
#24
Product Hunt

memi

A macOS workbench where Claude, Codex, and Hermes run on your specs, research, and Figma files.

β˜… 99 votes
πŸ’¬ 7
πŸ“… 1d
#25
Product Hunt

Locofy: design-to-code agents

Locofy is the agentic frontend layer between Figma and Cursor & Claude. Using proprietary design-aware models combined with LLMs, Locofy converts Figma designs into structured, responsive, pixel-p...

β˜… 96 votes
πŸ’¬ 8
πŸ“… 1d
#26
GitHub

penpot

Penpot: The open-source design tool for design and code collaboration

β˜… 94 votes
πŸ’¬ 0
πŸ“… 1d
#27
Product Hunt

Ploy.ai

Ploy is an agentic marketing platform that powers all your digital channels, starting with your website. It slurps your site, designs pages, runs ad campaigns, personalizes content, and syncs data bac...

β˜… 93 votes
πŸ’¬ 3
πŸ“… 1d
#28
Product Hunt

Wolfram Language 15

New and updated functionality in Wolfram Language 15: LLM & AI, notebook & user interfaces, symbolic & numeric computations, visualization & graphics, geometry & graphs, astronomy,...

β˜… 92 votes
πŸ’¬ 1
πŸ“… 1d
#29
Product Hunt

Redactify

Bleep out swear words in your videos automatically, and blur speakers' mouths so nothing slips through. Built for creators, podcasters, and media teams shipping to TikTok, YouTube, Twitch and Spot

β˜… 91 votes
πŸ’¬ 7
πŸ“… 1d
#30
Product Hunt

ClipDone

Creating video content still takes too much work. The best marketing videos usually come from real people speaking to a camera, but manual editing is slow and expensive. It shouldn’t be easier to gene...

β˜… 86 votes
πŸ’¬ 2
πŸ“… 1d