Data, ML & Research

Training, fine-tuning, datasets, MLOps

45 tools ยท All-time leaderboard

โ† All Categories

๐Ÿ† Top 30 in Data, ML & Research

๐Ÿฅ‡
GitHub

tensorflow

An Open Source Machine Learning Framework for Everyone

โ˜… 195,729 votes
๐Ÿ’ฌ 0
๐Ÿฅˆ
GitHub

ollama

Get up and running with Kimi-K2.6, GLM-5.1, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models.

โ˜… 174,448 votes
๐Ÿ’ฌ 0
๐Ÿฅ‰
GitHub

transformers

๐Ÿค— Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

โ˜… 161,696 votes
๐Ÿ’ฌ 0
#4
GitHub

llama.cpp

LLM inference in C/C++

โ˜… 117,132 votes
๐Ÿ’ฌ 0
#5
GitHub

pytorch

Tensors and Dynamic neural networks in Python with strong GPU acceleration

โ˜… 100,805 votes
๐Ÿ’ฌ 0
#6
GitHub

scikit-learn

scikit-learn: machine learning in Python

โ˜… 66,360 votes
๐Ÿ’ฌ 0
#7
GitHub

keras

Deep Learning for humans

โ˜… 64,096 votes
๐Ÿ’ฌ 0
#8
GitHub

openinterpreter

A lightweight coding agent for open models like Deepseek, Kimi, and Qwen

โ˜… 64,041 votes
๐Ÿ’ฌ 0
#9
GitHub

llama

Inference code for Llama models

โ˜… 59,467 votes
๐Ÿ’ฌ 0
#10
GitHub

GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

โ˜… 58,804 votes
๐Ÿ’ฌ 0
#11
GitHub

TTS

๐Ÿธ๐Ÿ’ฌ - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

โ˜… 45,582 votes
๐Ÿ’ฌ 0
#12
GitHub

bert

TensorFlow code and pre-trained models for BERT

โ˜… 40,030 votes
๐Ÿ’ฌ 0
#13
GitHub

quivr

Opiniated RAG for integrating GenAI in your apps ๐Ÿง  Focus on your product rather than the RAG. Easy integration in existing products with customisation! Any LLM: GPT4, Groq, Llama. Any Vectorstore:...

โ˜… 39,166 votes
๐Ÿ’ฌ 0
#14
GitHub

Langchain-Chatchat

Langchain-Chatchat๏ผˆๅŽŸLangchain-ChatGLM๏ผ‰ๅŸบไบŽ Langchain ไธŽ ChatGLM, Qwen ไธŽ Llama ็ญ‰่ฏญ่จ€ๆจกๅž‹็š„ RAG ไธŽ Agent ๅบ”็”จ | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and Ll...

โ˜… 38,187 votes
๐Ÿ’ฌ 0
#15
GitHub

Retrieval-based-Voice-Conversion-WebUI

Easily train a good VC model with voice data <= 10 mins!

โ˜… 36,052 votes
๐Ÿ’ฌ 0
#16
GitHub

khoj

Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous AI ...

โ˜… 35,190 votes
๐Ÿ’ฌ 0
#17
GitHub

diffusers

๐Ÿค— Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.

โ˜… 33,878 votes
๐Ÿ’ฌ 0
#18
GitHub

pytorch-lightning

Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.

โ˜… 31,191 votes
๐Ÿ’ฌ 0
#19
GitHub

llama3

The official Meta Llama 3 GitHub site

โ˜… 29,287 votes
๐Ÿ’ฌ 0
#20
GitHub

awesome-generative-ai-guide

A one stop repository for generative AI research updates, interview resources, notebooks and much more!

โ˜… 27,324 votes
๐Ÿ’ฌ 0
#21
GitHub

Qwen3

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

โ˜… 27,310 votes
๐Ÿ’ฌ 0
#22
GitHub

Qwen

The official repo of Qwen (้€šไน‰ๅƒ้—ฎ) chat & pretrained large language model proposed by Alibaba Cloud.

โ˜… 21,307 votes
๐Ÿ’ฌ 0
#23
GitHub

rasa

๐Ÿ’ฌ Open source machine learning framework to automate text- and voice-based conversations: NLU, dialogue management, connect to Slack, Facebook, and more - Create chatbots and voice assistants

โ˜… 21,218 votes
๐Ÿ’ฌ 0
#24
GitHub

trl

Train transformer language models with reinforcement learning.

โ˜… 18,665 votes
๐Ÿ’ฌ 0
#25
GitHub

llama-cookbook

Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama model f...

โ˜… 18,365 votes
๐Ÿ’ฌ 0
#26
GitHub

DocsGPT

Private AI platform for agents, assistants and enterprise search. Built-in Agent Builder, Deep research, Document analysis, Multi-model support, and API connectivity for agents.

โ˜… 17,941 votes
๐Ÿ’ฌ 0
#27
GitHub

Megatron-LM

Ongoing research training transformer models at scale

โ˜… 16,748 votes
๐Ÿ’ฌ 0
#28
GitHub

Qwen-Agent

Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.

โ˜… 16,577 votes
๐Ÿ’ฌ 0
#29
GitHub

tvm

Open Machine Learning Compiler Framework

โ˜… 13,474 votes
๐Ÿ’ฌ 0
#30
GitHub

TensorRT

NVIDIAยฎ TensorRTโ„ข is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.

โ˜… 13,080 votes
๐Ÿ’ฌ 0