A feature-rich command-line audio/video downloader
π€ Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
A latent text-to-image diffusion model
π Text-Prompted Generative Audio Model
Learn System Design concepts and prepare for interviews using free resources.
π€ Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
A one stop repository for generative AI research updates, interview resources, notebooks and much more!
Generative Models by Stability AI
π€ PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
High-Resolution Image Synthesis with Latent Diffusion Models
A curated list of modern Generative Artificial Intelligence projects and services
PyTorch package for the discrete VAE used for DALLΒ·E.
Code for the paper "Jukebox: A Generative Model for Music"
A curated list of Generative AI tools, works, models, and references
Open-source live-chat, email support, omni-channel desk. An alternative to Intercom, Zendesk, Salesforce Service Cloud etc. π₯π¬
Extracted system prompts from Anthropic - Claude Fable 5, Opus 4.8, Claude Code, Claude Design. OpenAI - ChatGPT 5.5 Thinking, GPT 5.5 Instant, Codex. Google - Gemini 3.5 Flash, 3.1 Pro, Antigravity. ...
AI logo animation skill: turn raster logos into smooth SVG animation, animated HTML demos, GIF/video previews, and motion QA evidence.
Agents bring AI to the canvas to help you design, write, analyze, and organize your sites. Weβre also launching Branching, a new way for teams to explore ideas before they go live, and unveiling the n...
Turn any video into an interactive AI experience. With Agentic Videos, viewers don't just watch - they pause, ask questions, get real-time answers, and interact with the presenter inside the video...
Multi-platform creators have a broken workflow: record landscape for YouTube, flip the phone, record again for Shorts. Or record once and spend 20 minutes cropping a second version that still looks of...
Buddy is the most powerful AI design agent inside Figma. And if you already pay for ChatGPT, plug it in and chat for free. No AI credits. Generate screens, flows, and variants on your canvas. Clone an...
A macOS workbench where Claude, Codex, and Hermes run on your specs, research, and Figma files.
Locofy is the agentic frontend layer between Figma and Cursor & Claude. Using proprietary design-aware models combined with LLMs, Locofy converts Figma designs into structured, responsive, pixel-p...
Penpot: The open-source design tool for design and code collaboration
Ploy is an agentic marketing platform that powers all your digital channels, starting with your website. It slurps your site, designs pages, runs ad campaigns, personalizes content, and syncs data bac...
New and updated functionality in Wolfram Language 15: LLM & AI, notebook & user interfaces, symbolic & numeric computations, visualization & graphics, geometry & graphs, astronomy,...
Bleep out swear words in your videos automatically, and blur speakers' mouths so nothing slips through. Built for creators, podcasters, and media teams shipping to TikTok, YouTube, Twitch and Spot
Creating video content still takes too much work. The best marketing videos usually come from real people speaking to a camera, but manual editing is slow and expensive. It shouldnβt be easier to gene...