Tools
826 open-source AI tools, models, frameworks, and agents — curated from awesome-opensource-ai
826 open-source AI tools, models, frameworks, and agents — curated from awesome-opensource-ai
License
Language
Type
| Name↑ | Section | Lang | License | Type | Description |
|---|---|---|---|---|---|
| 2FastLabs Agent Squad | Agentic AI & Multi-Agent SystemsMulti-Agent Orchestration | — | Apache-2.0 | AGENT | Flexible, lightweight open-source framework for orchestrating multiple AI agents to handle complex conversations with parallel execution capabilities. Apache 2.0 licensed |
| A2A Protocol | Agentic AI & Multi-Agent SystemsMulti-Agent Orchestration | — | Apache-2.0 | AGENT | Agent2Agent (A2A) open protocol enabling communication and interoperability between opaque agentic applications. Donated to Linux Foundation by Google with 50+ technology partners. Apache 2.0 licensed |
| ACE-Step 1.5 | Generative Media ToolsAudio / Music / Voice Generation | — | — | TOOL | Local-first music generation model with broad hardware support across Mac, AMD, Intel, and CUDA devices |
| Activepieces | Agentic AI & Multi-Agent SystemsDomain-Specific Agents | — | — | AGENT | Open-source automation platform with AI agents, MCP integrations, and self-hosted workflow orchestration |
| AdalFlow | Agentic AI & Multi-Agent SystemsPrompt Engineering & Structured Outputs | — | MIT | AGENT | Library to build and auto-optimize LLM applications with LLM-AutoDiff for fine-tuning-free optimization. End-to-end workflow optimization with tracing and human-in-the-loop capabilities. MIT licensed |
| Adversarial Robustness Toolbox (ART) | AI Safety, Alignment & InterpretabilityAdversarial & Red-teaming Tools | — | MIT | TOOL | Python library for machine learning security supporting evasion, poisoning, extraction, and inference attacks. Most complete collection of adversarial attack and defense methods for deep learning. MIT licensed |
| Agency Swarm | Agentic AI & Multi-Agent SystemsMulti-Agent Orchestration | — | — | AGENT | Reliable multi-agent orchestration framework built on top of the OpenAI Assistants API with organizational structure modeling |
| Agent Chat UI | User Interfaces & Self-hosted PlatformsAgent & Voice Infrastructure | — | MIT | TOOL | Web app for interacting with any LangGraph agent (Python & TypeScript) via a chat interface. Stream messages, handle interruptions, and view agent state. MIT licensed |
| Agent Development Kit (Google) | Agentic AI & Multi-Agent SystemsSingle-Agent Frameworks | — | Apache-2.0 | AGENT | Code-first Python toolkit for building sophisticated AI agents with multi-agent orchestration, built-in evaluation, and flexible deployment. Model-agnostic with tight Google ecosystem integration. Apache 2.0 licensed |
| Agent File | Agentic AI & Multi-Agent SystemsAgent Protocols & Standards | — | Apache-2.0 | AGENT | Open file format (.af) for serializing stateful AI agents with persistent memory and behavior. Share, checkpoint, and version control agents across compatible frameworks. Apache 2.0 licensed |
| Agent Squad (AWS Labs) | Agentic AI & Multi-Agent SystemsMulti-Agent Orchestration | — | Apache-2.0 | AGENT | Flexible multi-agent orchestration framework with intelligent intent classification and context management. Supports Python and TypeScript with pre-built agents for Bedrock, Lex, and custom integrations. Apache 2.0 licensed |
| Agent-S (Simular AI) | Agentic AI & Multi-Agent SystemsDomain-Specific Agents | — | — | AGENT | Open agentic framework that uses computers like a human. SOTA on OSWorld benchmark (72.6%) for GUI automation and computer control |
| Agenta | MLOps / LLMOps & ProductionMonitoring, Evaluation & Observability | — | — | TOOL | Open-source LLMOps platform combining prompt playground, prompt management, LLM evaluation, and observability |
| AgentBench (THUDM) | Evaluation, Benchmarks & DatasetsBenchmark Suites | — | Apache-2.0 | BENCHMARK | Comprehensive benchmark to evaluate LLMs as agents across 8 diverse environments including household, web shopping, OS interaction, and database tasks. ICLR 2024. Apache 2.0 licensed |
| Agentic Security | AI Safety, Alignment & InterpretabilityAdversarial & Red-teaming Tools | — | Apache-2.0 | TOOL | Agentic LLM vulnerability scanner and AI red teaming kit with multi-step attack simulation and automated security probing. Apache 2.0 licensed |
| AgentOps | AI Safety, Alignment & InterpretabilitySafety Evaluation Frameworks | — | MIT | TOOL | Python SDK for AI agent monitoring, LLM cost tracking, benchmarking, and evaluation. Integrates with CrewAI, Agno, OpenAI Agents SDK, LangChain, Autogen, AG2, and CamelAI. MIT licensed |
| AgentScope | Agentic AI & Multi-Agent SystemsMulti-Agent Orchestration | — | — | AGENT | Alibaba's production-ready multi-agent framework with 23K+ stars. Features built-in MCP and A2A support, message hub for flexible orchestration, and AgentScope Runtime for production deployment |
| Agno | Agentic AI & Multi-Agent SystemsSingle-Agent Frameworks | — | — | AGENT | Build, run, and manage agentic software at scale. High-performance framework for multi-agent systems with memory, knowledge, and tools |
| AI Engineering Hub | Resources & LearningCourses & Interactive Playgrounds | — | MIT | TOOL | 93+ production-ready projects with in-depth tutorials on LLMs, RAG, and real-world AI agent applications. Comprehensive resources for all skill levels from beginner to advanced. MIT licensed |
| AI Fairness 360 | AI Safety, Alignment & InterpretabilityFairness & Bias Mitigation | — | — | TOOL | Comprehensive toolkit for detecting, understanding, and mitigating unwanted algorithmic bias in datasets and ML models |
| AI For Beginners (Microsoft) | Resources & LearningCourses & Interactive Playgrounds | — | — | TOOL | 12-week, 24-lesson curriculum on Artificial Intelligence. Covers symbolic AI, neural networks, computer vision, NLP, and reinforcement learning with hands-on labs |
| AI Town | Agentic AI & Multi-Agent SystemsMulti-Agent Orchestration | — | MIT | AGENT | Deployable starter kit for building virtual towns where AI characters live, chat and socialize. Inspired by Stanford's Generative Agents research with persistent agent memory and social interactions. MIT licensed |
| AI-Infra-Guard (Tencent) | AI Safety, Alignment & InterpretabilityAdversarial & Red-teaming Tools | — | Apache-2.0 | TOOL | Full-stack AI Red Teaming platform securing AI ecosystems via OpenClaw Security Scan, Agent Scan, Skills Scan, MCP scan, AI Infra scan and LLM jailbreak evaluation. Apache 2.0 licensed |
| AI-Scientist-v2 (SakanaAI) | Agentic AI & Multi-Agent SystemsDomain-Specific Agents | — | — | AGENT | Workshop-level automated scientific discovery via agentic tree search. Generates novel research ideas, runs experiments, and writes papers |
| AI-Toolkit | Training & Fine-tuning EcosystemFull Training Frameworks | — | MIT | TOOL | Ultimate training toolkit for finetuning diffusion models. Easy-to-use all-in-one training suite supporting FLUX.1, FLUX.2, Stable Diffusion, and video models with both GUI and CLI interfaces. Consumer-grade hardware friendly with comprehensive LoRA and full fine-tuning support. MIT licensed |
| AIBrix | Inference Engines & ServingHigh-performance Serving & API Servers | — | Apache-2.0 | TOOL | Cost-efficient and pluggable infrastructure components for GenAI inference. Kubernetes-native control plane for vLLM with distributed KV cache, heterogeneous GPU serving, and intelligent routing. Apache 2.0 licensed |
| AIChat | Developer Tools & IntegrationsCLI Tools & API Clients | — | Apache-2.0 | TOOL | All-in-one LLM CLI in Rust featuring Shell Assistant, Chat-REPL, RAG, AI Tools & Agents. Supports 20+ providers. MIT/Apache 2.0 licensed |
| aicommits | Developer Tools & IntegrationsCLI Tools & API Clients | — | MIT | TOOL | CLI that writes your Git commit messages for you with AI. Never write a commit message again. Supports multiple providers including OpenAI, Groq, xAI, Ollama, and LM Studio. MIT licensed |
| Aider | Developer Tools & IntegrationsAI Coding Assistants (open-source) | — | Apache-2.0 | TOOL | Terminal-based AI pair programmer. Edit code in your local editor and aider implements the changes. Supports multiple LLMs, voice coding, and automatic Git commits. Top scores on SWE Bench. Apache 2.0 licensed |
| Aider | Agentic AI & Multi-Agent SystemsAutonomous Coding Agents | — | — | AGENT | Command-line pair-programming agent |
| Aider Desk | Developer Tools & IntegrationsAI-Native IDEs & Development Environments | — | Apache-2.0 | TOOL | Platform for AI-powered software engineers. Desktop application that enhances the aider terminal experience with a modern UI. Apache 2.0 licensed |
| Aim | MLOps / LLMOps & ProductionExperiment Tracking & Versioning | — | Apache-2.0 | TOOL | Self-hosted ML experiment tracker designed to handle 10,000s of training runs with performant UI and SDK for programmatic access. Apache 2.0 licensed |
| aisuite | Retrieval-Augmented Generation (RAG) & KnowledgeLLM Application Frameworks | — | MIT | TOOL | Simple, unified interface to multiple Generative AI providers. Use OpenAI, Anthropic, Google, and 10+ other providers with a standardized API similar to OpenAI's. Switch between models or providers with a single line of code. MIT licensed |
| AIX360 | AI Safety, Alignment & InterpretabilityInterpretability & Explainability | — | Apache-2.0 | TOOL | Comprehensive AI explainability toolkit with interpretability algorithms for data and machine learning models. Includes TED, BRCG, and ProtoNN methods for diverse explanation needs. Apache 2.0 licensed |
| align-anything | Training & Fine-tuning EcosystemFull Training Frameworks | — | Apache-2.0 | TOOL | Training all-modality models with feedback. Supports RLHF, DPO, and alignment fine-tuning for text, image, audio, and video models with seamless Slurm cluster integration. Apache 2.0 licensed |
| Alignment Handbook | AI Safety, Alignment & InterpretabilityAlignment & RLHF Tools | — | — | TOOL | Complete recipes for full-stack alignment |
| Amphion | Generative Media ToolsAudio / Music / Voice Generation | — | — | TOOL | Comprehensive toolkit for Audio, Music, and Speech Generation (9.7K stars) |
| Amundsen | Core Frameworks & LibrariesData Engineering & Feature Stores | — | Apache-2.0 | TOOL | Data discovery and metadata engine from Lyft. PageRank-style search for data resources with usage-based ranking. LF AI & Data Foundation project. Apache 2.0 licensed |
| Andrej Karpathy Skills | Resources & LearningCurated Resource Lists | — | MIT | TOOL | A single CLAUDE.md file to improve Claude Code behavior, derived from Andrej Karpathy's observations on LLM coding pitfalls. Principles: Think Before Coding, Simplicity First, Surgical Changes, Goal-Driven Execution. MIT licensed |
| AnythingLLM | User Interfaces & Self-hosted PlatformsFull Self-hosted AI Platforms | — | — | TOOL | All-in-one RAG + agents platform |
| Apache Airflow | Core Frameworks & LibrariesData Processing & Manipulation | — | Apache-2.0 | TOOL | Platform to programmatically author, schedule, and monitor workflows. Industry-standard orchestration for data pipelines and ML workflows with 500+ integrations. Apache 2.0 licensed |
| Apache Beam | Core Frameworks & LibrariesData Processing & Manipulation | — | Apache-2.0 | TOOL | Unified programming model for batch and streaming data processing. Write pipelines once, run anywhere on Flink, Spark, or Google Cloud Dataflow. Portable, extensible, and enterprise-ready for AI data pipelines. Apache 2.0 licensed |
| Apache Flink | Core Frameworks & LibrariesData Processing & Manipulation | — | Apache-2.0 | TOOL | Stream processing framework with powerful batch and streaming capabilities. High-throughput, low-latency runtime with exactly-once processing guarantees. Ideal for real-time AI inference pipelines and event-driven ML applications. Apache 2.0 licensed |
| Apache Hudi | Core Frameworks & LibrariesData Processing & Manipulation | — | Apache-2.0 | TOOL | Open data lakehouse platform for ingesting, indexing, storing, serving, transforming and managing data across cloud environments. Supports upserts, deletes and incremental processing on big data with built-in ingestion tools for Spark and Flink. Apache 2.0 licensed |
| Apache Iceberg | Core Frameworks & LibrariesData Processing & Manipulation | — | Apache-2.0 | TOOL | High-performance open table format for huge analytic tables. Brings SQL table reliability to big data with time travel, hidden partitioning, and schema evolution. Works with Spark, Trino, Flink, Presto, Hive and Impala. Apache 2.0 licensed |
| Apache Solr | Retrieval-Augmented Generation (RAG) & KnowledgeVector Databases & Search Engines | — | — | TOOL | Mature Lucene-based search platform with dense vector search, filtering, faceting, and hybrid retrieval patterns for production search-heavy RAG systems |
| Apache Spark | Core Frameworks & LibrariesData Processing & Manipulation | — | Apache-2.0 | TOOL | Unified analytics engine for large-scale data processing. In-memory cluster computing with high-level APIs in Python, Scala, Java, and R. Powers MLlib for distributed machine learning and Structured Streaming for real-time data. Apache 2.0 licensed |
| Apache TVM | Specialized DomainsEdge / On-device AI | — | Apache-2.0 | TOOL | Open Machine Learning Compiler Framework. Universal deployment to bring models into minimum deployable modules that can be embedded and run everywhere from datacenter to edge devices. Apache 2.0 licensed |
| Apache YuniKorn | MLOps / LLMOps & ProductionDeployment & Orchestration | — | Apache-2.0 | TOOL | Kubernetes resource scheduler for batch, data, and ML workloads. Provides hierarchical resource queues, multi-tenancy fairness, and gang scheduling for big data and machine learning applications. Apache 2.0 licensed |
| Aphrodite Engine | Inference Engines & ServingHigh-performance Serving & API Servers | — | — | TOOL | vLLM fork optimized for role-play and creative writing. Supports extensive quantization methods (AQLM, AWQ, GPTQ, GGUF, FP8) and modern samplers. Active development with multi-LoRA and speculative decoding support |
| Archon | Agentic AI & Multi-Agent SystemsAutonomous Coding Agents | — | MIT | AGENT | Workflow engine for deterministic AI coding agents. Define development processes as YAML workflows (planning → implementation → validation → review → PR) with isolated Git worktrees for parallel execution. MIT licensed |
| Argilla | Training & Fine-tuning EcosystemSynthetic Data Generation | — | — | TOOL | Open-source data labeling + synthetic data platform |
| Argo Workflows | MLOps / LLMOps & ProductionDeployment & Orchestration | — | Apache-2.0 | TOOL | CNCF graduated container-native workflow engine for orchestrating parallel jobs on Kubernetes. Powers Kubeflow Pipelines and widely used for ML/data processing at scale. Apache 2.0 licensed |
| ArviZ | Specialized DomainsProbabilistic Programming & Bayesian ML | — | Apache-2.0 | TOOL | Exploratory analysis of Bayesian models with Python. Comprehensive visualization and diagnostics for probabilistic models, supporting PyMC, Pyro, Stan, and other PPLs. Apache 2.0 licensed |
| Assistant UI | Developer Tools & IntegrationsUI Components & Chat Libraries | — | — | TOOL | React/TypeScript library for building production-grade AI chat interfaces. Drop-in components for streaming messages, tool calls, and multi-modal inputs |
| Astropy | Specialized DomainsScientific AI & Physics ML | — | BSD-3-Clause | TOOL | Core library for astronomy and astrophysics in Python. Comprehensive tools for celestial coordinates, FITS I/O, cosmological calculations, and data analysis for professional astronomy. BSD-3-Clause licensed |
| AutoFlow | Retrieval-Augmented Generation (RAG) & KnowledgeRAG Frameworks & Advanced Retrieval Tools | — | Apache-2.0 | TOOL | Graph RAG-based conversational knowledge base tool built on TiDB Vector and LlamaIndex. Features Perplexity-style search with built-in website crawler. Apache 2.0 licensed |
| AutoGen (AG2) | Agentic AI & Multi-Agent SystemsSingle-Agent Frameworks | — | — | AGENT | Flexible multi-agent conversation framework |
| AutoGluon | Core Frameworks & LibrariesAutoML & Hyperparameter Optimization | — | — | TOOL | AWS AutoML toolkit for tabular, image, text, and multimodal data - state-of-the-art with almost zero code |
| AutoGPT | Agentic AI & Multi-Agent SystemsSingle-Agent Frameworks | — | — | AGENT | The original autonomous AI agent framework that sparked the agent revolution. Vision of accessible AI for everyone with modular agent architecture, benchmark testing, and forge-based agent building. 183k+ stars |
| AutoKeras | Core Frameworks & LibrariesAutoML & Hyperparameter Optimization | — | — | TOOL | Neural architecture search on top of Keras |
| AutoPrompt | Agentic AI & Multi-Agent SystemsPrompt Engineering & Structured Outputs | — | Apache-2.0 | AGENT | Intent-based prompt calibration framework that iteratively optimizes prompts through automated edge case generation and refinement. Reduces manual prompt engineering effort while addressing prompt sensitivity and ambiguity. Apache 2.0 licensed |
| AutoRAG | Evaluation, Benchmarks & DatasetsEvaluation Frameworks | — | Apache-2.0 | TOOL | RAG AutoML tool for automatically finding optimal RAG pipelines. Evaluates and optimizes retrieval-augmented generation with AutoML-style automation for your own data and use-case. Apache 2.0 licensed |
| AutoTS | Specialized DomainsTime Series & Scientific AI | — | — | TOOL | Automated time series forecasting with broad model selection, ensembling, anomaly detection, and holiday effects. Designed for production deployment with minimal setup |
| Autoware | Specialized DomainsAutonomous Driving & Robotics Simulators | — | Apache-2.0 | TOOL | World's leading open-source software project for autonomous driving. Complete stack from localization and object detection to route planning and control. Used by 50+ companies globally. Apache 2.0 licensed |
| avante.nvim | Developer Tools & IntegrationsIDE Plugins & Extensions | — | Apache-2.0 | TOOL | Neovim plugin that brings Cursor-like AI IDE features to Vim. Edit code with natural language, generate code from context, and chat with AI about your codebase. Apache 2.0 licensed |
| Awesome Machine Learning | Resources & LearningCurated Resource Lists | — | — | TOOL | The definitive curated list of machine learning frameworks, libraries and software organized by language. Covers Python, C++, Java, JavaScript, and more with comprehensive coverage of the ML ecosystem. CC0-1.0 licensed |
| Ax | Retrieval-Augmented Generation (RAG) & KnowledgeLLM Application Frameworks | — | Apache-2.0 | TOOL | TypeScript framework for building reliable AI applications. "Official" DSPy-inspired framework for TypeScript with type-safe LLM interactions, chain-of-thought reasoning, and structured output validation. Apache 2.0 licensed |
| Axolotl | Training & Fine-tuning EcosystemFull Training Frameworks | — | — | TOOL | YAML-driven full pipeline for SFT, DPO, GRPO |
| BabyAGI | Agentic AI & Multi-Agent SystemsSingle-Agent Frameworks | — | — | AGENT | Pioneering task-driven autonomous agent that inspired the AI agent movement. Simple, elegant implementation of an AI agent that creates, prioritizes, and executes tasks autonomously. 22k+ stars |
| BeeAI Framework (IBM) | Agentic AI & Multi-Agent SystemsMulti-Agent Orchestration | — | — | AGENT | Production-ready multi-agent framework in Python and TypeScript. Features workflow orchestration, ACP/MCP protocol support, and deep watsonx integration. Part of Linux Foundation AI & Data program |
| BentoML | MLOps / LLMOps & ProductionDeployment & Orchestration | — | — | TOOL | Unified framework to build, ship, and scale AI apps |
| Bespoke Curator | Training & Fine-tuning EcosystemSynthetic Data Generation | — | Apache-2.0 | TOOL | Synthetic data curation for post-training and structured data extraction. Makes it easy to build pipelines around LLMs with batching and progress tracking. Apache 2.0 licensed |
| BGE (FlagEmbedding) | Retrieval-Augmented Generation (RAG) & KnowledgeEmbedding Models | — | — | TOOL | BAAI's best-in-class embedding family |
| big-AGI | User Interfaces & Self-hosted PlatformsLocal AI Chat UIs & Personal Assistants | — | MIT | TOOL | AI suite for power users with multi-model "Beam" chats, AI personas, voice, text-to-image, code execution, and PDF import. MIT licensed |
| biniou | User Interfaces & Self-hosted PlatformsFull Self-hosted AI Platforms | — | — | TOOL | Self-hosted webUI for 30+ generative AI models. Generate multimedia content with AI on your own computer, even without dedicated GPU (8GB RAM minimum). Works offline once deployed. GPL-3.0 licensed |
| BionicGPT | User Interfaces & Self-hosted PlatformsLocal AI Chat UIs & Personal Assistants | — | — | TOOL | On-prem ChatGPT replacement for teams with assistants, RAG, access controls, auditing, and enterprise deployment features |
| bitsandbytes | Inference Engines & ServingQuantization, Distillation & Optimization | — | — | TOOL | 8-bit and 4-bit optimizers + quantization |
| Bloom | AI Safety, Alignment & InterpretabilitySafety Evaluation Frameworks | — | MIT | TOOL | Open-source agentic framework for automated behavioral evaluations of frontier AI models. Generates targeted evaluation suites to probe LLMs for specific behaviors (sycophancy, self-preservation, political bias, etc.) with quantitative elicitation rates. From Anthropic's safety research team. MIT licensed |
| Boltz | Specialized DomainsScientific AI & Drug Discovery | — | MIT | TOOL | Open-source biomolecular interaction prediction models. Boltz-1 was the first fully open source model to approach AlphaFold3 accuracy; Boltz-2 adds binding affinity prediction for drug discovery. MIT licensed |
| Browser Use | Agentic AI & Multi-Agent SystemsDomain-Specific Agents | — | MIT | AGENT | Makes websites accessible for AI agents. Enables autonomous web automation, data extraction, and task completion with natural language instructions. MIT licensed |
| BrowserGym | Evaluation, Benchmarks & DatasetsEvaluation Frameworks | — | Apache-2.0 | TOOL | Gym environment for web task automation and agent evaluation. Includes MiniWoB, WebArena, WorkArena, and more. Apache 2.0 licensed |
| Burn | Core Frameworks & LibrariesRust ML Frameworks | Rust | — | TOOL | Next-generation deep learning framework in Rust. Backend-agnostic with CPU, GPU, WebAssembly support |
| Burr | Agentic AI & Multi-Agent SystemsSingle-Agent Frameworks | — | Apache-2.0 | AGENT | Apache incubating framework for building stateful AI applications (chatbots, agents, simulations). Monitor, trace, persist, and execute on your own infrastructure with built-in UI and pluggable memory. Apache 2.0 licensed |
| CAI | AI Safety, Alignment & InterpretabilityAdversarial & Red-teaming Tools | — | MIT | TOOL | Cybersecurity AI framework for semi- and fully-automating offensive and defensive security tasks. Purpose-built for cybersecurity use cases with agent-based architecture for vulnerability assessment and security operations. MIT licensed |
| CAMEL | Agentic AI & Multi-Agent SystemsMulti-Agent Orchestration | — | Apache-2.0 | AGENT | First and best multi-agent framework for building scalable agent systems. Apache 2.0 licensed with extensive tooling for agent communication and task automation |
| Candle (Hugging Face) | Core Frameworks & LibrariesRust ML Frameworks | Rust | — | TOOL | Minimalist ML framework for Rust. PyTorch-like API with focus on performance and simplicity |
| Captum | AI Safety, Alignment & InterpretabilityInterpretability & Explainability | — | — | TOOL | PyTorch's official interpretability library |
| CARLA | Specialized DomainsAutonomous Driving & Robotics Simulators | — | MIT | TOOL | Open-source simulator for autonomous driving research. High-fidelity simulation of urban environments with realistic physics, sensors, and traffic scenarios. Widely used for training and validating self-driving algorithms. MIT licensed |
| Casibase | User Interfaces & Self-hosted PlatformsLocal AI Chat UIs & Personal Assistants | — | Apache-2.0 | TOOL | Open-source enterprise-level AI knowledge base and agent management platform. Supports multiple LLM providers, RAG, and team collaboration. Apache-2.0 licensed |
| CatBoost | Core Frameworks & LibrariesClassical ML & Gradient Boosting | — | — | TOOL | Gradient boosting that handles categorical features natively with great out-of-the-box performance |
| Cerebras Model Zoo | MLOps / LLMOps & ProductionModel Hubs & Registries | — | Apache-2.0 | TOOL | Collection of deep learning models and utilities optimized for Cerebras hardware. Includes reference implementations for Llama, Mixtral, DINOv2, and Llava with configuration files, data preprocessing tools, and checkpoint converters. 1,150+ stars. Apache 2.0 licensed |
| ChainForge | MLOps / LLMOps & ProductionMonitoring, Evaluation & Observability | — | MIT | TOOL | Visual programming environment for battle-testing prompts and evaluating LLM outputs. Features node-based prompt chains, multi-model comparison, and hypothesis testing. MIT licensed |
| ChatALL | User Interfaces & Self-hosted PlatformsDesktop & Mobile AI Apps | — | Apache-2.0 | TOOL | Concurrently chat with multiple AI bots to discover the best answers. Desktop app for comparing ChatGPT, Claude, Gemini, and 20+ LLMs side-by-side. Apache 2.0 licensed |
| Chatbox | User Interfaces & Self-hosted PlatformsDesktop & Mobile AI Apps | — | — | TOOL | Powerful desktop AI client for ChatGPT, Claude, and other LLMs. Cross-platform with modern UI. GPLv3 licensed (Community Edition) |
| ChatDev | Agentic AI & Multi-Agent SystemsMulti-Agent Orchestration | — | Apache-2.0 | AGENT | Multi-agent software development framework where AI agents collaborate as programmers, designers, and testers to build software. Apache 2.0 licensed |
| Chatterbox (Resemble AI) | Open Foundation ModelsSpeech & Audio Models (TTS, STT, Music) | — | — | MODEL | State-of-the-art open TTS family with 350M parameter Turbo variant. Single-step generation with native paralinguistic tags for realistic dialogue |
| ChatTTS | Open Foundation ModelsSpeech & Audio Models (TTS, STT, Music) | — | — | MODEL | Generative speech model optimized for daily dialogue. Natural, expressive conversational speech synthesis with fine-grained prosody control. AGPL-3.0 licensed |
| Cherry Studio | User Interfaces & Self-hosted PlatformsDesktop & Mobile AI Apps | — | — | TOOL | AI productivity studio with smart chat, autonomous agents, and 300+ assistants. Unified access to frontier LLMs. AGPL-3.0 licensed |
| Chonkie | Retrieval-Augmented Generation (RAG) & KnowledgeRAG Frameworks & Advanced Retrieval Tools | — | MIT | TOOL | Lightweight document chunking library for fast, efficient RAG pipelines. Memory-safe with multiple chunking strategies (semantic, token, recursive) and direct vector DB integration. MIT licensed |
| Chroma | Retrieval-Augmented Generation (RAG) & KnowledgeVector Databases & Search Engines | — | — | TOOL | Most popular open-source embedding database |
| Chronos (Amazon) | Specialized DomainsTime Series & Scientific AI | — | — | TOOL | Pretrained foundation models for time-series forecasting |
| Civitai | MLOps / LLMOps & ProductionModel Hubs & Registries | — | Apache-2.0 | TOOL | Open-source AI model hub and community platform for sharing and discovering generative AI models, with focus on image generation models. Features model versioning, reviews, and integrated inference. Apache 2.0 licensed |
| Claude Squad | Developer Tools & IntegrationsCLI Tools & API Clients | — | — | TOOL | Manage multiple AI terminal agents like Claude Code, Codex, OpenCode, and Amp. Terminal multiplexer for AI coding agents with session management and parallel execution. AGPL-3.0 licensed |
| Cleanlab | Evaluation, Benchmarks & DatasetsHigh-quality Open Datasets & Data Tools | — | Apache-2.0 | TOOL | Data-centric AI package for automatically finding and fixing issues in datasets. Detects label errors, outliers, and ambiguous examples in ML datasets. Apache 2.0 licensed |
| ClearML | MLOps / LLMOps & ProductionExperiment Tracking & Versioning | — | — | TOOL | Open-source platform for experiment tracking, orchestration, data management, and model serving |
| Cline | Developer Tools & IntegrationsAI Coding Assistants (open-source) | — | — | TOOL | Open-source IDE coding agent that can edit files, run commands, and use tools with user approval |
| CoAI | User Interfaces & Self-hosted PlatformsFull Self-hosted AI Platforms | — | Apache-2.0 | TOOL | Next-generation multi-tenant AI one-stop solution with built-in admin and billing system. Enterprise-grade unified LLM gateway supporting 200+ models and 35+ providers. Apache-2.0 licensed |
| Code Server | Developer Tools & IntegrationsAI-Native IDEs & Development Environments | — | MIT | TOOL | Run VS Code on any machine anywhere and access it in the browser. Self-hosted cloud IDE with full extension support. MIT licensed |
| CodeCompanion.nvim | Developer Tools & IntegrationsIDE Plugins & Extensions | — | — | TOOL | AI-powered coding assistant for Neovim. Inline code generation, chat, actions, and tool use with support for multiple LLM providers |
| Codex CLI | Developer Tools & IntegrationsCLI Tools & API Clients | — | Apache-2.0 | TOOL | OpenAI's lightweight coding agent that runs in your terminal. Code generation, file editing, and command execution with approval. Apache 2.0 licensed |
| Cog (Replicate) | MLOps / LLMOps & ProductionModel Packaging & Deployment | — | Apache-2.0 | TOOL | Containerize and deploy ML models with production-grade inference servers. Packages models into standardized containers with automatic API generation, GPU support, and one-command deployment. Powers thousands of production AI models on Replicate. Apache 2.0 licensed |
| Colossal-AI | Training & Fine-tuning EcosystemDistributed Training | — | — | TOOL | Unified system for 100B+ models |
| ColPali / ColQwen | Retrieval-Augmented Generation (RAG) & KnowledgeRAG Frameworks & Advanced Retrieval Tools | — | — | TOOL | Vision-language models for document retrieval |
| ComfyUI | Generative Media ToolsImage Generation & Editing | — | — | TOOL | Node-based visual workflow editor for Stable Diffusion, FLUX, etc |
| Complete Agentic AI Engineering Course | Resources & LearningCourses & Interactive Playgrounds | — | MIT | TOOL | 6-week comprehensive course on Agentic AI covering autonomous agents, multi-agent systems, and practical agent development. MIT licensed |
| Composer | Core Frameworks & LibrariesModel Training & Optimization Utilities | — | Apache-2.0 | TOOL | Supercharge your model training. MosaicML's PyTorch training library with built-in algorithms for efficient training (FSDP, gradient compression, progressive resizing) and seamless distributed training on large-scale clusters. Apache 2.0 licensed |
| Composio | Agentic AI & Multi-Agent SystemsDomain-Specific Agents | — | — | AGENT | Tool integration layer for AI agents with 1000+ toolkits, authentication management, and sandboxed workbench. Powers tool use across major frameworks |
| Composio Agent Orchestrator | Agentic AI & Multi-Agent SystemsMulti-Agent Orchestration | — | MIT | AGENT | Agentic orchestrator for parallel coding agents. Plans tasks, spawns agents, and autonomously handles CI fixes, merge conflicts, and code reviews. MIT licensed |
| Conductor OSS | Agentic AI & Multi-Agent SystemsMulti-Agent Orchestration | — | Apache-2.0 | AGENT | Event-driven agentic orchestration platform providing durable and resilient execution engine for applications and AI agents. Battle-tested at Netflix, Tesla, LinkedIn, and J.P. Morgan with 30K+ stars. Apache 2.0 licensed |
| Context7 | Developer Tools & IntegrationsCLI Tools & API Clients | — | MIT | TOOL | Up-to-date code documentation for LLMs and AI code editors. Fetches latest docs and code examples directly into LLM context via MCP. Eliminates hallucinated APIs. MIT licensed |
| ContextGem | Retrieval-Augmented Generation (RAG) & KnowledgeLLM Application Frameworks | — | Apache-2.0 | TOOL | Effortless LLM extraction framework for documents. Powerful abstractions for building extraction workflows with automated dynamic prompts, data modeling, validation, and precise reference mapping. Apache 2.0 licensed |
| Continue | Developer Tools & IntegrationsAI Coding Assistants (open-source) | — | — | TOOL | Open-source AI coding autopilot for VS Code & JetBrains |
| CoPaw | User Interfaces & Self-hosted PlatformsLocal AI Chat UIs & Personal Assistants | — | Apache-2.0 | TOOL | Your Personal AI Assistant; easy to install, deploy on your own machine or on the cloud; supports multiple chat apps with easily extensible capabilities. Apache-2.0 licensed |
| CopilotKit | Developer Tools & IntegrationsUI Components & Chat Libraries | — | MIT | TOOL | Best-in-class SDK for building full-stack agentic applications, Generative UI, and chat applications. Creators of the AG-UI Protocol adopted by Google, LangChain, AWS, and Microsoft. MIT licensed |
| CosyVoice | Open Foundation ModelsSpeech & Audio Models (TTS, STT, Music) | — | Apache-2.0 | MODEL | Multi-lingual large voice generation model with full-stack inference, training and deployment capabilities. Supports cross-lingual voice cloning and emotional expression control. Apache 2.0 licensed |
| Crawl4AI | Retrieval-Augmented Generation (RAG) & KnowledgeWeb Data Ingestion | — | — | TOOL | LLM-friendly web crawler that turns websites into clean Markdown for RAG and agentic workflows |
| CrewAI | Agentic AI & Multi-Agent SystemsSingle-Agent Frameworks | — | — | AGENT | Role-based agent framework |
| CTranslate2 | Inference Engines & ServingAdditional Inference Engines | — | MIT | TOOL | Fast inference engine for Transformer models supporting OpenNMT and Hugging Face models. Optimized for CPU and GPU with batching, quantization (INT8/FP16), and dynamic memory management. Powers faster-whisper and other production deployments. MIT licensed |
| cuDF | Core Frameworks & LibrariesData Processing & Manipulation | — | — | TOOL | GPU DataFrame library from RAPIDS. Accelerates Pandas workflows on NVIDIA GPUs with zero code changes using cuDF.pandas accelerator mode |
| cuGraph | Core Frameworks & LibrariesData Processing & Manipulation | — | Apache-2.0 | TOOL | GPU graph analytics library with NetworkX-compatible API. 10-100x faster than CPU for large-scale graph algorithms. Apache 2.0 licensed |
| cuML | Core Frameworks & LibrariesClassical ML & Gradient Boosting | — | Apache-2.0 | TOOL | GPU-accelerated machine learning algorithms with scikit-learn compatible API. 10-50x faster than CPU implementations for large datasets. Apache 2.0 licensed |
| CVAT | Specialized DomainsComputer Vision | — | — | TOOL | Industry-leading data annotation platform for computer vision. Interactive video and image annotation tool used by tens of thousands of teams for machine learning at any scale |
| D-Tale | Core Frameworks & LibrariesData Processing & Manipulation | — | — | TOOL | Visualizer for Pandas data structures with a Flask back-end and React front-end. Interactive data exploration with charting, filtering, and code export. LGPL-2.1 licensed |
| Dagster | MLOps / LLMOps & ProductionDeployment & Orchestration | — | Apache-2.0 | TOOL | Cloud-native orchestration platform for developing and maintaining data assets including ML models. Declarative programming model with integrated lineage and observability. Apache 2.0 licensed |
| Darts | Core Frameworks & LibrariesClassical ML & Gradient Boosting | — | Apache-2.0 | TOOL | User-friendly forecasting and anomaly detection for time series. Unifies classical statistical models (ARIMA, ETS) with modern neural networks (N-BEATS, TFT, DeepAR) in a single scikit-learn compatible API. Apache 2.0 licensed |
| Dask | Core Frameworks & LibrariesData Processing & Manipulation | — | — | TOOL | Parallel computing for big data - scales Pandas/NumPy/scikit-learn to clusters |
| Data Science for Beginners (Microsoft) | Resources & LearningCourses & Interactive Playgrounds | — | — | TOOL | 10-week, 20-lesson curriculum on data science fundamentals. Covers data preparation, visualization, modeling, and deployment with practical projects |
| Data-Juicer | Training & Fine-tuning EcosystemSynthetic Data Generation | — | — | TOOL | High-performance data processing for LLM training |
| DataHub | Core Frameworks & LibrariesData Engineering & Feature Stores | — | Apache-2.0 | TOOL | The #1 open-source metadata platform for data and AI. Data discovery, governance, and observability with 80+ connectors, column-level lineage, and AI assistant integration. Originally built at LinkedIn. Apache 2.0 licensed |
| Datashader | Core Frameworks & LibrariesData Processing & Manipulation | — | BSD-3-Clause | TOOL | High-performance large data visualization. Renders billions of points interactively without aggregation artifacts. BSD-3-Clause licensed |
| DataTrove (Hugging Face) | Training & Fine-tuning EcosystemSynthetic Data Generation | — | — | TOOL | Platform-agnostic data processing pipelines for LLM training at scale. Handles filtering, deduplication, and tokenization on local machines or SLURM clusters |
| Daytona | Developer Tools & IntegrationsAI-Native IDEs & Development Environments | — | — | TOOL | Secure elastic infrastructure for running AI-generated code. Self-hosted alternative to GitHub Codespaces with support for multiple IDEs, prebuilds, and any cloud provider. AGPL-3.0 licensed |
| DB-GPT | Retrieval-Augmented Generation (RAG) & KnowledgeRAG Frameworks & Advanced Retrieval Tools | — | — | TOOL | Self-hosted AI data assistant for private knowledge, database-aware conversations, and data-heavy RAG workflows |
| dbt-core | Core Frameworks & LibrariesData Transformation & Analytics Engineering | — | Apache-2.0 | TOOL | Transform data using software engineering best practices. The industry-standard framework for analytics engineering with 15M+ monthly downloads. Enables version control, testing, and documentation for SQL transformations. Apache 2.0 licensed |
| Deep Chat | Developer Tools & IntegrationsUI Components & Chat Libraries | — | MIT | TOOL | Fully customizable AI chatbot component for your website. Supports OpenAI, direct API services, and custom endpoints. MIT licensed |
| Deep Lake | Retrieval-Augmented Generation (RAG) & KnowledgeVector Databases & Search Engines | — | Apache-2.0 | TOOL | AI Data Runtime for Agents with serverless PostgreSQL and multimodal datalake. Store and search vectors, images, text, videos, and more with LangChain/LlamaIndex integrations. Used by Intel, Bayer, Yale, and Oxford. Apache 2.0 licensed |
| Deep RL Class (Hugging Face) | Resources & LearningCourses & Interactive Playgrounds | — | — | TOOL | Free deep reinforcement learning course with hands-on exercises and trained agent publishing to the Hugging Face Hub |
| Deep-Live-Cam | Generative Media ToolsFace Swap & Deepfake | — | — | TOOL | Real-time face swap and one-click video deepfake with only a single image. High-quality face swapping for live video streaming and content creation. AGPL-3.0 licensed |
| DeepChat | User Interfaces & Self-hosted PlatformsDesktop & Mobile AI Apps | — | Apache-2.0 | TOOL | A smart assistant that connects powerful AI to your personal world. Built-in MCP and ACP support, multiple search engines, privacy-focused with local data storage. Apache-2.0 licensed |
| Deepchecks | MLOps / LLMOps & ProductionMonitoring, Evaluation & Observability | — | — | TOOL | Holistic validation and testing suite for ML models and data. Continuous validation from research to production with 50+ built-in checks for data integrity, distribution drift, and model performance |
| DeepChem | Specialized DomainsScientific AI & Drug Discovery | — | MIT | TOOL | Democratizing deep learning for drug discovery, quantum chemistry, materials science, and biology. High-quality open-source toolchain with 50+ models and extensive tutorials. MIT licensed |
| DeepCode | Developer Tools & IntegrationsAI Coding Assistants (open-source) | — | MIT | TOOL | Transforms research papers and natural language into production-ready code. AI-powered research-to-code automation tool. MIT licensed |
| DeepEval | Evaluation, Benchmarks & DatasetsEvaluation Frameworks | — | — | TOOL | The "Pytest for LLMs" |
| Deepnote | Developer Tools & IntegrationsNotebooks & Interactive Computing | — | Apache-2.0 | TOOL | Drop-in replacement for Jupyter with AI-first design, sleek UI, and native data integrations. Use Python, R, and SQL locally, then scale to Deepnote cloud for collaboration and deployable data apps. Apache 2.0 licensed |
| DeepResearchAgent | Agentic AI & Multi-Agent SystemsMulti-Agent Orchestration | — | — | AGENT | Hierarchical multi-agent system for deep research tasks with automated task decomposition and execution across complex domains |
| DeepSeek-Coder-V2 / R1-Coder | Open Foundation ModelsCoding & Reasoning Models | — | — | MODEL | Best-in-class open coding model (236B MoE). Outperforms closed models on many code benchmarks |
| DeepSpeed | Core Frameworks & LibrariesModel Training & Optimization Utilities | — | — | TOOL | Microsoft's deep learning optimization library for extreme-scale training (ZeRO, offloading, MoE) |
| DeepSpeed | Training & Fine-tuning EcosystemDistributed Training | — | — | TOOL | Extreme-scale training optimizations |
| DeepTeam (Confident AI) | MLOps / LLMOps & ProductionGuardrails & Safety Tools | — | Apache-2.0 | TOOL | Red teaming framework for LLM systems with 50+ vulnerabilities, 20+ adversarial attacks, and production-ready guardrails. Includes OWASP, NIST, and MITRE ATLAS framework mappings. Apache 2.0 licensed |
| Deequ | Core Frameworks & LibrariesData Quality & Validation | — | Apache-2.0 | TOOL | Library built on top of Apache Spark for defining "unit tests for data". Measures data quality in large datasets with constraint verification, anomaly detection, and incremental validation. Used at Amazon for production data quality. Apache 2.0 licensed |
| Deer-Flow (ByteDance) | Agentic AI & Multi-Agent SystemsMulti-Agent Orchestration | — | — | AGENT | Open-source long-horizon SuperAgent harness that researches, codes, and creates. Handles tasks from minutes to hours with sandboxes, memories, tools, skills, subagents, and message gateway |
| Delta Lake | Core Frameworks & LibrariesData Processing & Manipulation | — | Apache-2.0 | TOOL | Open-source storage framework enabling Lakehouse architecture with ACID transactions, scalable metadata handling, and unified batch/streaming processing. Apache 2.0 licensed |
| Depth Anything V2 | Open Foundation ModelsMultimodal Models (Vision + Language) | — | Apache-2.0 | MODEL | Foundation model for monocular depth estimation trained on 595K synthetic and 62M+ real images. Provides robust, fine-grained depth estimation for any image. Apache 2.0 licensed |
| DesktopCommander MCP | Developer Tools & IntegrationsCLI Tools & API Clients | — | MIT | TOOL | MCP server for Claude providing terminal control, file system search, and diff file editing capabilities. Enables autonomous code editing through Model Context Protocol. MIT licensed |
| Deta Surf | Developer Tools & IntegrationsNotebooks & Interactive Computing | — | Apache-2.0 | TOOL | Personal AI notebook for organizing files and webpages with AI-generated notes. Local-first data storage, open data formats, and open model choice including local models. Cross-platform desktop app for research and thinking workflows. Apache 2.0 licensed |
| Detectron2 | Specialized DomainsComputer Vision | — | — | TOOL | High-performance object detection library |
| Detoxify | AI Safety, Alignment & InterpretabilityAdversarial & Red-teaming Tools | — | Apache-2.0 | TOOL | Trained models and code to predict toxic comments on all 3 Jigsaw Toxic Comment Challenges. Built using PyTorch Lightning and Transformers for toxicity, severe toxicity, obscene, threat, insult, identity attack, and sexual explicit content detection. Apache 2.0 licensed |
| Dia (Nari Labs) | Open Foundation ModelsSpeech & Audio Models (TTS, STT, Music) | — | — | MODEL | 1.6B parameter TTS generating ultra-realistic dialogue in one pass with nonverbal communications (laughter, coughing). Emotion and tone control via audio conditioning |
| Diffrax | Core Frameworks & LibrariesDeep Learning Frameworks | — | Apache-2.0 | TOOL | Numerical differential equation solvers in JAX. Autodifferentiable and GPU-capable ODE/SDE/CDE solvers for scientific machine learning and neural differential equations. Apache 2.0 licensed |
| Diffusers | Generative Media ToolsImage Generation & Editing | — | — | TOOL | PyTorch library for diffusion pipelines spanning image, video, and audio generation |
| Dify | Agentic AI & Multi-Agent SystemsDomain-Specific Agents | — | — | AGENT | Production-ready agentic workflow platform |
| DiskANN (Microsoft) | Retrieval-Augmented Generation (RAG) & KnowledgeVector Databases & Search Engines | — | MIT | TOOL | Graph-structured indices for scalable, fast, fresh and filtered approximate nearest neighbor search. Handles billion-vector datasets on a single node with SSD-based indexing. MIT licensed |
| distilabel | Training & Fine-tuning EcosystemSynthetic Data Generation | — | — | TOOL | End-to-end pipeline for synthetic instruction data |
| distributed-llama | Inference Engines & ServingAdditional Inference Engines | — | MIT | TOOL | Distributed LLM inference connecting home devices into a powerful cluster. More devices means faster inference via tensor parallelism over Ethernet. Supports Linux, macOS, Windows, ARM, and x86_64 AVX2 CPUs. MIT licensed |
| Dive | User Interfaces & Self-hosted PlatformsDesktop & Mobile AI Apps | — | MIT | TOOL | Open-source MCP Host Desktop Application with dual Tauri/Electron architecture. Seamlessly integrates with any LLMs supporting function calling. MIT licensed |
| DJL (Deep Java Library) | MLOps / LLMOps & ProductionModel Hubs & Registries | — | Apache-2.0 | TOOL | Engine-agnostic deep learning framework for Java with built-in model zoo. Load and run PyTorch, TensorFlow, MXNet, and ONNX models with a unified API. Includes 80+ pre-trained models for CV and NLP. Apache 2.0 licensed |
| dm-haiku | Core Frameworks & LibrariesDeep Learning Frameworks | — | Apache-2.0 | TOOL | JAX-based neural network library from Google DeepMind. Elegant functional API with state management, widely used in DeepMind's research. Apache 2.0 licensed |
| Doccano | Core Frameworks & LibrariesData Labeling & Annotation | — | MIT | TOOL | Open-source text annotation tool for machine learning practitioners. Features text classification, sequence labeling, and sequence-to-sequence tasks for sentiment analysis, NER, and summarization. MIT licensed |
| DocETL (UC Berkeley) | Retrieval-Augmented Generation (RAG) & KnowledgeDocument Conversion & Preprocessing | — | MIT | TOOL | Agentic LLM-powered data processing and ETL system for complex document processing. Query rewriting and evaluation for unstructured data analysis with 80% higher accuracy than baselines. MIT licensed |
| Docling | Retrieval-Augmented Generation (RAG) & KnowledgeRAG Frameworks & Advanced Retrieval Tools | — | — | TOOL | Document processing toolkit for turning PDFs and other files into structured data for GenAI workflows |
| DocsGPT | Retrieval-Augmented Generation (RAG) & KnowledgeRAG Frameworks & Advanced Retrieval Tools | — | MIT | TOOL | Private AI platform for building intelligent agents and assistants with enterprise search. Features Agent Builder, deep research tools, multi-format document analysis, and multi-model support. MIT licensed |
| Drawdata | Developer Tools & IntegrationsNotebooks & Interactive Computing | — | MIT | TOOL | Draw datasets from within Python notebooks. Interactive data visualization tool for creating and editing datasets directly in Jupyter environments. MIT licensed |
| DSPy | Agentic AI & Multi-Agent SystemsSingle-Agent Frameworks | — | — | AGENT | Framework for programming language model pipelines with modules, optimizers, and evaluation loops |
| dstack | Training & Fine-tuning EcosystemDistributed Training | — | MPL-2.0 | TOOL | Vendor-agnostic orchestration for training, inference and agentic workloads across NVIDIA, AMD, TPU, and Tenstorrent on clouds, Kubernetes, and bare metal. MPL-2.0 licensed |
| DuckDB | Core Frameworks & LibrariesData Processing & Manipulation | — | MIT | TOOL | High-performance analytical in-process SQL database system. Fast, reliable, portable, and easy to use with rich SQL dialect support. Perfect for data processing and analytics workloads. MIT licensed |
| DVC (Data Version Control) | MLOps / LLMOps & ProductionExperiment Tracking & Versioning | — | — | TOOL | Git-like versioning for data and models |
| E2B Code Interpreter | Evaluation, Benchmarks & DatasetsEvaluation Frameworks | — | Apache-2.0 | TOOL | Python & JS/TS SDK for running AI-generated code in secure isolated sandboxes. Essential infrastructure for evaluating code-generating LLMs with safe execution environments. Apache 2.0 licensed |
| E5 (Microsoft) | Retrieval-Augmented Generation (RAG) & KnowledgeEmbedding Models | — | — | TOOL | High-performance text embeddings for retrieval |
| EasyEdit | AI Safety, Alignment & InterpretabilityInterpretability & Explainability | — | MIT | TOOL | Easy-to-use knowledge editing framework for LLMs. Enables precise modification of model knowledge and behavior to correct hallucinations or outdated information. ACL 2024. MIT licensed |
| EasyR1 | Training & Fine-tuning EcosystemFull Training Frameworks | — | Apache-2.0 | TOOL | Efficient, scalable, multi-modality RL training framework based on veRL. Extends veRL to support vision-language models with GRPO algorithm for efficient RL training. Apache 2.0 licensed |
| EchoMimic (Ant Group) | Generative Media ToolsPortrait Animation | — | Apache-2.0 | TOOL | Lifelike audio-driven portrait animations through editable landmark conditioning. High-quality talking head generation with precise lip synchronization and natural head movements. AAAI 2025. Apache 2.0 licensed |
| Eino | Retrieval-Augmented Generation (RAG) & KnowledgeLLM Application Frameworks | — | Apache-2.0 | TOOL | The ultimate LLM/AI application development framework in Go. Drawing from LangChain and Google ADK, designed to follow Go conventions with composable components for chains, agents, and workflows. Apache 2.0 licensed |
| einops | Core Frameworks & LibrariesModel Training & Optimization Utilities | — | — | TOOL | Flexible, powerful tensor operations for readable and reliable code. Supports PyTorch, JAX, TensorFlow, NumPy, MLX |
| Elasticsearch | Retrieval-Augmented Generation (RAG) & KnowledgeVector Databases & Search Engines | — | — | TOOL | Distributed search and analytics engine with native k-NN vector search, hybrid search, and dense vector indexing. Industry-standard for full-text search now with powerful semantic search capabilities. AGPL-3.0/Elastic-2.0 dual licensed |
| ELI5 | AI Safety, Alignment & InterpretabilityInterpretability & Explainability | — | MIT | TOOL | Library for debugging/inspecting machine learning classifiers and explaining their predictions. Supports scikit-learn, XGBoost, LightGBM, and more with feature importance and explanation visualizations. MIT licensed |
| elizaOS | Agentic AI & Multi-Agent SystemsMulti-Agent Orchestration | — | — | AGENT | Autonomous multi-agent framework for building and deploying AI-powered applications. Features Discord/Telegram/Farcaster connectors, RAG support, and a modern web dashboard |
| EmbedAnything | Retrieval-Augmented Generation (RAG) & KnowledgeEmbedding Models | — | Apache-2.0 | TOOL | Minimalist, highly performant multimodal embedding pipeline built in Rust. Memory-safe, modular, and production-ready for text, image, and audio embeddings with seamless vector DB integration. Apache 2.0 licensed |
| EmbedChain | Retrieval-Augmented Generation (RAG) & KnowledgeRAG Frameworks & Advanced Retrieval Tools | — | Apache-2.0 | TOOL | Universal memory layer for AI agents. Simple API to create RAG applications over any dataset with support for multiple vector stores, embedding models, and LLM providers. Apache 2.0 licensed |
| Envoy AI Gateway | MLOps / LLMOps & ProductionMonitoring, Evaluation & Observability | — | Apache-2.0 | TOOL | Manages unified access to generative AI services built on Envoy Gateway. Kubernetes-native AI gateway for routing, load balancing, and managing LLM traffic with enterprise-grade reliability. Apache 2.0 licensed |
| Equinox | Core Frameworks & LibrariesDeep Learning Frameworks | — | Apache-2.0 | TOOL | Elegant easy-to-use neural networks and scientific computing in JAX. Callable PyTrees with filtered transformations, seamless interoperability with the JAX ecosystem. Apache 2.0 licensed |
| EvalScope (ModelScope) | Evaluation, Benchmarks & DatasetsEvaluation Frameworks | — | Apache-2.0 | TOOL | Streamlined and customizable framework for efficient large model (LLM, VLM, AIGC) evaluation and performance benchmarking. One-stop evaluation solution with 80+ benchmarks. Apache 2.0 licensed |
| Evidently | MLOps / LLMOps & ProductionMonitoring, Evaluation & Observability | — | — | TOOL | ML & LLM monitoring framework |
| ExecuTorch | Specialized DomainsEdge / On-device AI | — | — | TOOL | PyTorch runtime and toolchain for deploying AI models on mobile, embedded, and edge devices |
| ExLlamaV2 | Inference Engines & ServingQuantization, Distillation & Optimization | — | — | TOOL | Highly optimized CUDA kernels for 4-bit/8-bit inference |
| exo | Inference Engines & ServingLocal / On-device Inference | — | Apache-2.0 | TOOL | Run frontier AI locally by connecting all your devices into an AI cluster. Features automatic device discovery, RDMA over Thunderbolt for 99% latency reduction, topology-aware auto parallel, and tensor parallelism. Uses MLX backend for distributed inference across Apple Silicon devices. Apache 2.0 licensed |
| F5-TTS | Open Foundation ModelsSpeech & Audio Models (TTS, STT, Music) | — | MIT | MODEL | Flow matching-based TTS with fluent and faithful speech synthesis. Zero-shot voice cloning with high naturalness and prosody accuracy. MIT licensed |
| Fairlearn | AI Safety, Alignment & InterpretabilityFairness & Bias Mitigation | — | MIT | TOOL | Python package to assess and improve fairness of machine learning models. Provides metrics for disparity assessment and algorithms for unfairness mitigation with scikit-learn integration. MIT licensed |
| fairseq2 | Core Frameworks & LibrariesNLP & Transformers | — | MIT | TOOL | FAIR Sequence Modeling Toolkit 2. Complete rewrite of fairseq with modern PyTorch APIs, native support for LLM training (70B+ models), vLLM integration, and first-party recipes for instruction finetuning and preference optimization. MIT licensed |
| Faiss | Retrieval-Augmented Generation (RAG) & KnowledgeVector Databases & Search Engines | — | — | TOOL | Similarity search and clustering library for dense vectors with CPU and GPU implementations |
| fastai | Core Frameworks & LibrariesModel Training & Optimization Utilities | — | Apache-2.0 | TOOL | Deep learning library providing practitioners with high-level components for state-of-the-art results. Built on PyTorch with a focus on usability and transfer learning. Apache 2.0 licensed |
| FastChat | Training & Fine-tuning EcosystemFull Training Frameworks | — | Apache-2.0 | TOOL | Open platform for training, serving, and evaluating large language model chatbots. Powers Chatbot Arena (lmarena.ai) serving 10M+ requests for 70+ LLMs. Includes training code for Vicuna, MT-Bench evaluation, and distributed multi-model serving with OpenAI-compatible APIs. Apache 2.0 licensed |
| FastEmbed (Qdrant) | Retrieval-Augmented Generation (RAG) & KnowledgeEmbedding Models | — | Apache-2.0 | TOOL | Lightweight, fast Python library for embedding generation with ONNX Runtime. Supports text, sparse (SPLADE), and late-interaction (ColBERT) embeddings without GPU dependencies. Apache 2.0 licensed |
| faster-whisper (SYSTRAN) | Open Foundation ModelsSpeech & Audio Models (TTS, STT, Music) | — | — | MODEL | Reimplementation of Whisper using CTranslate2 for up to 4x faster inference with same accuracy. Supports batched processing and 8-bit quantization |
| FastGPT | Retrieval-Augmented Generation (RAG) & KnowledgeRAG Frameworks & Advanced Retrieval Tools | — | — | TOOL | Knowledge-base platform with RAG retrieval, document processing, visual AI workflows, and self-hosted deployment options |
| Feast | MLOps / LLMOps & ProductionExperiment Tracking & Versioning | — | Apache-2.0 | TOOL | Open source feature store for ML. Manages offline/online feature storage with point-in-time correctness to prevent data leakage. Apache 2.0 licensed |
| Feature-engine | MLOps / LLMOps & ProductionFeature Engineering & Data Preparation | — | BSD-3-Clause | TOOL | Python library with multiple transformers to engineer and select features for machine learning models. scikit-learn compatible with fit() and transform() methods for encoding, imputation, variable transformation, and feature selection. BSD-3-Clause licensed |
| Featuretools | MLOps / LLMOps & ProductionFeature Engineering & Data Preparation | — | BSD-3-Clause | TOOL | Open-source Python library for automated feature engineering. Transforms transactional and relational datasets into feature matrices for machine learning using Deep Feature Synthesis with reusable primitives. BSD-3-Clause licensed |
| Fern | Developer Tools & IntegrationsSDKs & API Development Tools | — | Apache-2.0 | TOOL | Open-source SDK generator for REST APIs. Generate type-safe API clients in TypeScript, Python, Go, Java, and more from OpenAPI specs. Powers SDKs for companies like OpenAI, Anthropic, and Cloudflare. Apache 2.0 licensed |
| FiftyOne | Core Frameworks & LibrariesData Processing & Manipulation | — | Apache-2.0 | TOOL | Visual AI development toolkit for visualizing, labeling, and evaluating visual datasets and models. Supercharges computer vision workflows with dataset exploration and model analysis. Apache 2.0 licensed |
| Finetrainers | Specialized DomainsGame AI & Simulations | — | Apache-2.0 | TOOL | Scalable and memory-optimized training of diffusion models from Hugging Face. Supports LoRA and full fine-tuning for video and image generation models. Apache 2.0 licensed |
| FineWeb / FineWeb-2 (Hugging Face) | Evaluation, Benchmarks & DatasetsHigh-quality Open Datasets & Data Tools | — | — | TOOL | Curated 15T+ token web dataset for pre-training |
| FinGPT | Specialized DomainsFinance & Quantitative AI | — | MIT | TOOL | Open-source financial large language models. Democratizing financial AI with data-centric training pipeline and multiple model releases for trading, analysis, and robo-advising. MIT licensed |
| FinRL | Specialized DomainsFinance & Quantitative AI | — | MIT | TOOL | Financial reinforcement learning framework for quantitative trading. Deep RL library for stock trading, portfolio allocation, and market execution with pre-built environments and benchmarks. MIT licensed |
| FinRobot | Specialized DomainsFinance & Quantitative AI | — | Apache-2.0 | TOOL | Open-source AI agent platform for financial analysis using LLMs. Multi-agent system with specialized agents for trading, analysis, and research. Apache 2.0 licensed |
| Firecrawl | Retrieval-Augmented Generation (RAG) & KnowledgeWeb Data Ingestion | — | — | TOOL | Web Data API for AI - search, scrape, and interact with the web at scale. Clean markdown/JSON output with proxy rotation and JS-blocking handled automatically |
| Fish Speech / StyleTTS 2 | Open Foundation ModelsSpeech & Audio Models (TTS, STT, Music) | — | — | MODEL | Zero-shot TTS with excellent voice cloning. Extremely popular in 2026 |
| FLAML | Core Frameworks & LibrariesAutoML & Hyperparameter Optimization | — | — | TOOL | Microsoft's fast & lightweight AutoML focused on efficiency and low compute |
| FlashAttention | Core Frameworks & LibrariesModel Training & Optimization Utilities | — | — | TOOL | Fast exact attention kernels that reduce memory usage and accelerate transformer training and inference |
| FlashInfer | Inference Engines & ServingAdditional Inference Engines | — | Apache-2.0 | TOOL | Kernel library for LLM serving. High-performance CUDA kernels for attention, sampling, and matrix multiplication. Powers vLLM, SGLang, and other inference engines with optimized GPU kernels. Apache 2.0 licensed |
| FlashRAG | Retrieval-Augmented Generation (RAG) & KnowledgeRAG Frameworks & Advanced Retrieval Tools | — | MIT | TOOL | Efficient toolkit for RAG research with 40+ retrieval and reranking models, 20+ benchmark datasets, and optimized evaluation pipelines (WWW 2025 Resource). MIT licensed |
| Flowise | User Interfaces & Self-hosted PlatformsFull Self-hosted AI Platforms | — | — | TOOL | Drag-and-drop LLM app builder |
| Flux.jl | Core Frameworks & LibrariesJulia ML Frameworks | Julia | — | TOOL | 100% pure-Julia ML stack with lightweight abstractions on top of native GPU and AD support. Elegant, hackable, and fully integrated with Julia's scientific computing ecosystem |
| FluxGym | Training & Fine-tuning EcosystemFull Training Frameworks | — | — | TOOL | Dead simple FLUX LoRA training UI with LOW VRAM support (12GB/16GB/20GB). WebUI forked from AI-Toolkit with backend powered by Kohya Scripts. Combines simplicity of Gradio interface with flexibility of Kohya's powerful training scripts. GPL-3.0 licensed |
| Flyte | MLOps / LLMOps & ProductionDeployment & Orchestration | — | Apache-2.0 | TOOL | Kubernetes-native workflow orchestration platform for AI/ML pipelines. Dynamic, resilient orchestration with strong type safety and reproducibility. Used by Lyft, Spotify, and Gojek. Apache 2.0 licensed |
| Fooocus | Generative Media ToolsImage Generation & Editing | — | — | TOOL | Midjourney-style UI with beautiful out-of-the-box results |
| FunASR | Open Foundation ModelsSpeech & Audio Models (TTS, STT, Music) | — | MIT | MODEL | Fundamental end-to-end speech recognition toolkit with SOTA pretrained models. Supports ASR, VAD, speaker verification, diarization, and multi-talker ASR. Industrial-grade with 31-language support and real-time transcription services. MIT licensed |
| GAIA | Evaluation, Benchmarks & DatasetsBenchmark Suites | — | — | BENCHMARK | Real-world multi-step agentic benchmark |
| Garak (NVIDIA) | MLOps / LLMOps & ProductionGuardrails & Safety Tools | — | Apache-2.0 | TOOL | The LLM vulnerability scanner. Probes models for hallucinations, data leakage, prompt injection, misinformation, toxicity, and jailbreaks. Extensive plugin-based architecture with 100+ vulnerability probes. Apache 2.0 licensed |
| Gemini CLI (Google) | Agentic AI & Multi-Agent SystemsAutonomous Coding Agents | — | Apache-2.0 | AGENT | Open-source AI agent that brings Gemini's power directly into your terminal. Supports code generation, shell execution, and file editing with full Apache 2.0 licensing |
| Gemma 4 (Google) | Open Foundation ModelsLarge Language Models (Base + Chat) | — | Apache-2.0 | MODEL | Released April 2026 in four sizes (E2B, E4B, 26B MoE, 31B Dense). First major update in a year with Apache 2.0 license, complex logic, and agentic workflows |
| gemma.cpp | Inference Engines & ServingAdditional Inference Engines | — | Apache-2.0 | TOOL | Lightweight, standalone C++ inference engine for Google's Gemma models. Optimized for on-device deployment with minimal dependencies and efficient memory usage. Apache 2.0 licensed |
| Generative AI for Beginners (Microsoft) | Resources & LearningCourses & Interactive Playgrounds | — | — | TOOL | 21 lessons covering generative AI fundamentals, prompt engineering, RAG applications, fine-tuning, and LLM app deployment with practical exercises |
| Genkit | Retrieval-Augmented Generation (RAG) & KnowledgeLLM Application Frameworks | — | Apache-2.0 | TOOL | Open-source framework for building full-stack AI-powered applications in JavaScript, Go, and Python. Built and used in production by Google's Firebase. Unified interface for integrating AI models from multiple providers with built-in RAG, tool calling, structured outputs, and developer tools. Apache 2.0 licensed |
| GEPA | Developer Tools & IntegrationsPrompt Engineering & Management | — | MIT | TOOL | Reflective prompt evolution optimizer using natural language reflection and Pareto frontier learning. Outperforms reinforcement learning for prompt optimization. Integrated with DSPY and MLflow. MIT licensed |
| GGML | Core Frameworks & LibrariesDeep Learning Frameworks | — | MIT | TOOL | Tensor library for machine learning. The foundational C/C++ library powering llama.cpp and many on-device inference engines. MIT licensed |
| Giskard | MLOps / LLMOps & ProductionMonitoring, Evaluation & Observability | — | Apache-2.0 | TOOL | Open-source evaluation and testing library for LLM agents. Red teaming, vulnerability scanning, RAG evaluation, and safety testing with modular architecture. Apache 2.0 licensed |
| GitHub Copilot SDK | Developer Tools & IntegrationsSDKs & API Development Tools | — | MIT | TOOL | Multi-platform SDK for integrating GitHub Copilot Agent into apps and services. Production-tested agent runtime with planning, tool invocation, and context management. Build Copilot-style agents without writing your own orchestration. MIT licensed |
| GitIngest | Developer Tools & IntegrationsCLI Tools & API Clients | — | MIT | TOOL | Replace 'hub' with 'ingest' in any GitHub URL to get a prompt-friendly extract of a codebase. Optimized for Python ecosystem and data science workflows. MIT licensed |
| Gitpod | Developer Tools & IntegrationsAI-Native IDEs & Development Environments | — | — | TOOL | Cloud development environment platform with automated prebuilds, ephemeral workspaces, and support for any IDE. Self-hostable with open-source core. AGPL-3.0 licensed |
| GLM-4.5V / GLM-4.1V-Thinking (Zhipu AI) | Open Foundation ModelsMultimodal Models (Vision + Language) | — | — | MODEL | Strong multimodal reasoning with scalable reinforcement learning. Compares favorably with Gemini-2.5-Flash on benchmarks |
| GLM-5 (Zhipu AI) | Open Foundation ModelsLarge Language Models (Base + Chat) | — | — | MODEL | Strong open model line with solid coding, reasoning, and agentic-task performance |
| GluonTS (AWS Labs) | Specialized DomainsTime Series & Scientific AI | — | Apache-2.0 | TOOL | Probabilistic time series modeling with deep learning. Powers Amazon SageMaker forecasting with PyTorch and MXNet backends. Apache 2.0 licensed |
| Goose | Agentic AI & Multi-Agent SystemsAutonomous Coding Agents | — | — | AGENT | Extensible on-machine AI agent for development tasks |
| GPT-NeoX-20B (EleutherAI) | Open Foundation ModelsLarge Language Models (Base + Chat) | — | Apache-2.0 | MODEL | 20B parameter autoregressive language model trained on the Pile dataset. One of the largest dense open-source models with publicly available weights at release. Complete training codebase with distributed training support. Apache 2.0 licensed |
| GPT-OSS (OpenAI) | Open Foundation ModelsLarge Language Models (Base + Chat) | — | Apache-2.0 | MODEL | OpenAI's first open-weight models since GPT-2 (120B and 20B MoE). Apache 2.0 licensed with state-of-the-art performance for their size class. Released August 2025 |
| GPT-RAG (Azure) | Retrieval-Augmented Generation (RAG) & KnowledgeRAG Frameworks & Advanced Retrieval Tools | — | MIT | TOOL | Enterprise RAG pattern for Azure OpenAI at scale. Secure, production-ready architecture using Azure Cognitive Search and Azure OpenAI LLMs for ChatGPT-style Q&A experiences. MIT licensed |
| gpt-researcher | Agentic AI & Multi-Agent SystemsDomain-Specific Agents | — | Apache-2.0 | AGENT | Autonomous agent that conducts deep online research on any topic. Generates comprehensive reports with citations by orchestrating web searches, content scraping, and synthesis. Apache 2.0 licensed |
| GPT-SoVITS | Generative Media ToolsAudio / Music / Voice Generation | — | MIT | TOOL | Few-shot voice cloning with just 1 minute of voice data. Combines GPT and SoVITS architectures for high-quality TTS with cross-lingual support and emotional expression. MIT licensed |
| gptme | Agentic AI & Multi-Agent SystemsAutonomous Coding Agents | — | MIT | AGENT | Your agent in your terminal, equipped with local tools: writes code, uses the terminal, browses the web. Make your own persistent autonomous agent on top. MIT licensed |
| GPUStack | Inference Engines & ServingHigh-performance Serving & API Servers | — | — | TOOL | GPU cluster manager that orchestrates inference engines like vLLM and SGLang. Automated engine selection, parameter optimization, and distributed multi-GPU deployment for high-performance AI workloads |
| Gradio | Core Frameworks & LibrariesInteractive ML Apps & Notebooks | — | — | TOOL | Build and share delightful machine learning apps, all in Python. The de facto standard for creating interactive ML demos with automatic UI generation from function signatures. Powers thousands of Hugging Face Spaces |
| GraphCast | Specialized DomainsWeather & Climate AI | — | Apache-2.0 | TOOL | Deep learning weather forecasting model from Google DeepMind. State-of-the-art AI weather prediction with 10-day global forecasts matching or exceeding traditional numerical methods. Apache 2.0 licensed |
| Graphiti | Retrieval-Augmented Generation (RAG) & KnowledgeKnowledge Graphs for RAG | — | Apache-2.0 | TOOL | Build real-time temporal knowledge graphs for AI agents. Tracks how facts change over time with provenance to source data. Supports prescribed and learned ontology for evolving real-world data. Apache 2.0 licensed |
| GraphRAG (Microsoft) | Retrieval-Augmented Generation (RAG) & KnowledgeRAG Frameworks & Advanced Retrieval Tools | — | — | TOOL | Knowledge-graph-based RAG |
| Great Expectations | Core Frameworks & LibrariesData Quality & Validation | — | Apache-2.0 | TOOL | Always know what to expect from your data. Data validation, profiling, and documentation for data pipelines. Apache 2.0 licensed |
| Griptape | Agentic AI & Multi-Agent SystemsSingle-Agent Frameworks | — | Apache-2.0 | AGENT | Modular Python framework for AI agents and workflows with chain-of-thought reasoning, tools, and memory. Enforces structures like sequential pipelines and DAG-based workflows for predictable AI systems. Apache 2.0 licensed |
| gsplat (3D Gaussian Splatting tools) | Generative Media Tools3D & Creative Tools | — | — | TOOL | High-performance 3D Gaussian Splatting library |
| Guardrails AI | AI Safety, Alignment & InterpretabilityAdversarial & Red-teaming Tools | — | Apache-2.0 | TOOL | Input/output validation framework for building reliable AI applications. Detects and mitigates risks through composable validators for PII, toxicity, prompt injection, and structured output validation. Features Guardrails Hub with 50+ pre-built validators. Apache 2.0 licensed |
| Guidance | Agentic AI & Multi-Agent SystemsPrompt Engineering & Structured Outputs | — | MIT | AGENT | Efficient programming paradigm for steering language models. Control output structure with loops, conditionals, and regex constraints inline. Reduces latency and cost vs conventional prompting. MIT licensed |
| Gymnasium (ex-OpenAI Gym) | Specialized DomainsReinforcement Learning & Robotics | — | — | TOOL | Standard RL environment API |
| H2O LLM Studio | Training & Fine-tuning EcosystemFull Training Frameworks | — | Apache-2.0 | TOOL | No-code GUI framework for fine-tuning LLMs. Streamlined interface for SFT, reward modeling, and model deployment. Apache 2.0 licensed |
| Habitat-Sim | Specialized DomainsAutonomous Driving & Robotics Simulators | — | MIT | TOOL | High-performance physics-enabled 3D simulator for embodied AI research. Supports 3D scans of indoor/outdoor spaces, CAD models, and configurable sensors. Powers Meta's embodied AI research. MIT licensed |
| HAMi | MLOps / LLMOps & ProductionDeployment & Orchestration | — | Apache-2.0 | TOOL | Heterogeneous GPU Sharing on Kubernetes. CNCF sandbox project providing GPU virtualization, slicing, and scheduling for efficient AI workload management across heterogeneous accelerators (GPUs, NPUs, MLUs). Apache 2.0 licensed |
| Hamilton | Core Frameworks & LibrariesData Processing & Manipulation | — | Apache-2.0 | TOOL | Declarative dataflow framework for building testable, modular, self-documenting data pipelines. Encode lineage and metadata directly in Python functions. Originally from Stitch Fix, now Apache incubating. Apache 2.0 licensed |
| Harbor | Evaluation, Benchmarks & DatasetsEvaluation Frameworks | — | Apache-2.0 | TOOL | Framework for running agent evaluations and creating/using RL environments. Evaluate arbitrary agents like Claude Code, OpenHands, and Codex CLI. Build and share benchmarks and environments. Apache 2.0 licensed |
| Haystack | Retrieval-Augmented Generation (RAG) & KnowledgeRAG Frameworks & Advanced Retrieval Tools | — | — | TOOL | End-to-end NLP and RAG framework |
| Helicone | Developer Tools & IntegrationsPrompt Engineering & Management | — | Apache-2.0 | TOOL | Open-source LLM observability platform with prompt management, versioning, and experimentation. One-line integration, YC W23 company. Apache 2.0 licensed |
| Helicone | MLOps / LLMOps & ProductionMonitoring, Evaluation & Observability | — | — | TOOL | Open-source LLM observability with request logging, caching, rate limiting, and cost analytics |
| Helios (PKU-YuanGroup) | Generative Media ToolsVideo Generation | — | Apache-2.0 | TOOL | Efficient long-video generation framework with 24GB VRAM support for up to 10,000 frames (5+ minutes) and 1280×768 resolution. Apache 2.0 licensed |
| HelixDB | Retrieval-Augmented Generation (RAG) & KnowledgeVector Databases & Search Engines | — | — | TOOL | Graph-vector database for retrieval systems that need relationship traversal alongside semantic search |
| HELM (Stanford) | Evaluation, Benchmarks & DatasetsBenchmark Suites | — | — | BENCHMARK | Holistic Evaluation of Language Models |
| Hermes Agent (NousResearch) | Agentic AI & Multi-Agent SystemsSingle-Agent Frameworks | — | — | AGENT | The agent that grows with you. Autonomous server-side agent with persistent memory that learns and improves over time |
| Higress (Alibaba) | Inference Engines & ServingHigh-performance Serving & API Servers | — | Apache-2.0 | TOOL | AI-native API gateway born from Alibaba's internal infrastructure with 2+ years of production validation. Provides unified LLM API and MCP (Model Context Protocol) management with enterprise-grade 99.99% availability. Apache 2.0 licensed |
| Hindsight | Agentic AI & Multi-Agent SystemsAgent Memory & State | — | MIT | AGENT | State-of-the-art long-term memory for AI agents by Vectorize. Fully self-hosted, MIT-licensed, with integrations for LangChain, CrewAI, LlamaIndex, Vercel AI SDK, and more |
| Hive (Aden) | Agentic AI & Multi-Agent SystemsMulti-Agent Orchestration | — | Apache-2.0 | AGENT | Production-grade multi-agent orchestration framework with 10K+ stars. Apache 2.0 licensed |
| hnswlib | Retrieval-Augmented Generation (RAG) & KnowledgeVector Databases & Search Engines | — | Apache-2.0 | TOOL | Header-only C++ library for fast approximate nearest neighbors with Python bindings. Supports CRUD operations and concurrent read/write - unique among ANN libraries. Powers many production vector databases. Apache 2.0 licensed |
| Homemade Machine Learning (trekhleb) | Resources & LearningEducational Resources & Courses | — | MIT | TOOL | Python examples of popular machine learning algorithms with interactive Jupyter demos and mathematical explanations. Educational resource for understanding ML from scratch with visualizations. MIT licensed |
| Hugging Face Accelerate | Core Frameworks & LibrariesModel Training & Optimization Utilities | — | — | TOOL | Simple API to make training scripts run on any hardware (multi-GPU, TPU, mixed precision) with minimal code changes |
| Hugging Face Course | Resources & LearningCourses & Interactive Playgrounds | — | — | TOOL | Free hands-on courses using only open models |
| Hugging Face Datasets | Evaluation, Benchmarks & DatasetsHigh-quality Open Datasets & Data Tools | — | — | TOOL | Largest open repository of datasets |
| Hugging Face Discussions | Resources & LearningCommunities, Forums & Newsletters | — | — | TOOL | Largest open AI forum |
| Hugging Face Evaluate | Evaluation, Benchmarks & DatasetsEvaluation Frameworks | — | — | TOOL | Standardized evaluation metrics |
| Hugging Face Hub | MLOps / LLMOps & ProductionModel Hubs & Registries | — | Apache-2.0 | TOOL | Official Python client for the Hugging Face Hub. Download, upload, and manage 1M+ open-source ML models and datasets programmatically. The de facto standard for model sharing and distribution. Apache 2.0 licensed |
| Hugging Face Papers | Resources & LearningPapers with Open Implementations | — | — | TOOL | Daily-updated feed of the latest arXiv papers with open weights |
| Hugging Face Transformers Notebooks | Resources & LearningStarter Projects & Examples | — | — | TOOL | Run Transformers, Datasets, and more in Colab |
| HuggingChat (self-hosted) | User Interfaces & Self-hosted PlatformsLocal AI Chat UIs & Personal Assistants | — | — | TOOL | Official open-source codebase for HuggingChat |
| HunyuanVideo (Tencent) | Generative Media ToolsVideo Generation | — | — | TOOL | 13B-parameter systematic video generation framework. Leading quality among open models |
| Ibis | Core Frameworks & LibrariesData Processing & Manipulation | — | Apache-2.0 | TOOL | Portable Python dataframe library with 20+ backends. Write Pandas-like code that runs locally with DuckDB or scales to production databases (BigQuery, Snowflake, PostgreSQL) by changing one line. Apache 2.0 licensed |
| II-Agent (Intelligent Internet) | Agentic AI & Multi-Agent SystemsSingle-Agent Frameworks | — | Apache-2.0 | AGENT | New open-source framework to build and deploy intelligent agents with support for Claude, Gemini, and OpenAI models. Apache 2.0 licensed |
| ik_llama.cpp | Inference Engines & ServingAdditional Inference Engines | — | MIT | TOOL | High-performance llama.cpp fork with better CPU and hybrid GPU/CPU performance, SOTA quantization types, first-class Bitnet support, and improved DeepSeek performance via MLA, FlashMLA, and fused MoE operations. MIT licensed |
| Infinity (AI Database) | Retrieval-Augmented Generation (RAG) & KnowledgeRAG Frameworks & Advanced Retrieval Tools | — | Apache-2.0 | TOOL | AI-native database built for LLM applications with incredibly fast hybrid search of dense vector, sparse vector, tensor (multi-vector), and full-text. Powers RAGFlow's document engine. Apache 2.0 licensed |
| Infinity (Embeddings Server) | Retrieval-Augmented Generation (RAG) & KnowledgeRAG Frameworks & Advanced Retrieval Tools | — | — | TOOL | High-throughput, low-latency serving engine for text-embeddings, reranking, CLIP, and ColPali. OpenAI-compatible API |
| Inspect AI | Evaluation, Benchmarks & DatasetsEvaluation Frameworks | — | — | TOOL | Framework for large language model evaluations from the UK AI Security Institute |
| Instructor | Developer Tools & IntegrationsCLI Tools & API Clients | — | MIT | TOOL | Python library for extracting structured, validated data from LLMs using Pydantic models. Handles validation, retries, and error handling with 15+ provider support. MIT licensed |
| interpret (Microsoft) | AI Safety, Alignment & InterpretabilityInterpretability & Explainability | — | — | TOOL | Fit interpretable models and explain blackbox machine learning with state-of-the-art explainability techniques including Explainable Boosting Machines and SHAP-based explanations |
| InvokeAI | Generative Media ToolsImage Generation & Editing | — | — | TOOL | Full-featured creative studio |
| IREE | Core Frameworks & LibrariesHigh-Performance Compute Libraries | — | Apache-2.0 | TOOL | Retargetable MLIR-based machine learning compiler and runtime toolkit. Lowers ML models to unified IR that scales from datacenter to mobile and edge deployments. Apache 2.0 licensed |
| Isaac Lab | Specialized DomainsReinforcement Learning & Robotics | — | — | TOOL | GPU-accelerated robot learning framework |
| Jan | User Interfaces & Self-hosted PlatformsDesktop & Mobile AI Apps | — | — | TOOL | Local-first AI app framework |
| JAX | Core Frameworks & LibrariesDeep Learning Frameworks | — | — | TOOL | High-performance numerical computing with composable transformations (JIT, vmap, grad). Rising favorite for research and scientific ML |
| Jido | Agentic AI & Multi-Agent SystemsSingle-Agent Frameworks | — | Apache-2.0 | AGENT | Autonomous agent framework for Elixir. Built for distributed, autonomous behavior and dynamic workflows with actor-model concurrency. Apache 2.0 licensed |
| Julep | Agentic AI & Multi-Agent SystemsDomain-Specific Agents | — | — | AGENT | Stateful agent workflow platform with memory, tools, branching, and long-running task execution |
| Jupyter AI | Developer Tools & IntegrationsIDE Plugins & Extensions | — | — | TOOL | Chat and code generation inside notebooks |
| JVector (DataStax) | Retrieval-Augmented Generation (RAG) & KnowledgeVector Databases & Search Engines | — | Apache-2.0 | TOOL | The most advanced embedded vector search engine for Java. DiskANN-based algorithm for billion-scale vector search with efficient memory mapping. Apache 2.0 licensed |
| KAG (OpenSPG) | Retrieval-Augmented Generation (RAG) & KnowledgeRAG Frameworks & Advanced Retrieval Tools | — | Apache-2.0 | TOOL | Knowledge Augmented Generation framework for logical reasoning and factual Q&A in professional domains. Builds on OpenSPG knowledge graph engine to overcome traditional RAG vector similarity limitations. Supports multi-hop reasoning with schema-constrained knowledge construction. Apache 2.0 licensed |
| KaibanJS | Agentic AI & Multi-Agent SystemsSingle-Agent Frameworks | — | MIT | AGENT | JavaScript-native framework for building and managing multi-agent systems with a Kanban-inspired approach. Visual task board for AI agents with real-time collaboration features. MIT licensed |
| Katib (Kubeflow) | Core Frameworks & LibrariesAutoML & Hyperparameter Optimization | — | Apache-2.0 | TOOL | Kubernetes-native AutoML for hyperparameter tuning, early stopping, and neural architecture search. Framework-agnostic with support for TensorFlow, PyTorch, XGBoost, and custom training operators. Apache 2.0 licensed |
| Kedro | MLOps / LLMOps & ProductionFeature Engineering & Data Preparation | — | Apache-2.0 | TOOL | Toolbox for production-ready data science. Uses software engineering best practices to help you create data engineering and data science pipelines that are reproducible, maintainable, and modular. Apache 2.0 licensed |
| Keras | Core Frameworks & LibrariesDeep Learning Frameworks | — | — | TOOL | High-level, beginner-friendly API that now runs on multiple backends (TensorFlow, JAX, PyTorch). Perfect for rapid experimentation |
| Kernel Memory (Microsoft) | Retrieval-Augmented Generation (RAG) & KnowledgeRAG Frameworks & Advanced Retrieval Tools | — | MIT | TOOL | Memory solution for users, teams, and applications. RAG pipelines with document ingestion, vector indexing, and natural language querying with citations. Supports multiple LLM providers and vector stores. MIT licensed |
| Kestra | MLOps / LLMOps & ProductionDeployment & Orchestration | — | Apache-2.0 | TOOL | Event-driven orchestration and scheduling platform for mission-critical workflows. Infrastructure-as-Code approach with declarative YAML, Git version control integration, and hundreds of plugins for data pipelines and ML workflows. Apache 2.0 licensed |
| Khoj | User Interfaces & Self-hosted PlatformsLocal AI Chat UIs & Personal Assistants | — | — | TOOL | Self-hostable personal AI assistant for search, chat, automation, and workflows over local and web data |
| Kilo Code | Agentic AI & Multi-Agent SystemsAutonomous Coding Agents | — | — | AGENT | Open-source agentic coding assistant with IDE workflows, tool use, and support for local or OpenAI-compatible models |
| Kimi CLI | Developer Tools & IntegrationsAI Coding Assistants (open-source) | — | Apache-2.0 | TOOL | Kimi Code CLI agent from Moonshot AI. Terminal-based coding assistant with advanced context understanding and multi-file editing capabilities. Apache 2.0 licensed |
| Kimi K2 (Moonshot AI) | Open Foundation ModelsLarge Language Models (Base + Chat) | — | — | MODEL | State-of-the-art 1T parameter MoE model with 32B activated parameters and 128K context. Trained with Muon optimizer for exceptional reasoning and coding performance |
| Kimi K2.5 (Moonshot AI) | Open Foundation ModelsLarge Language Models (Base + Chat) | — | — | MODEL | Frontier open-weight MoE model with 256K context, strong coding and reasoning performance, and native multimodal + tool-use support for agentic workflows |
| KitOps | MLOps / LLMOps & ProductionDeployment & Orchestration | — | Apache-2.0 | TOOL | CNCF open source DevOps tool for packaging, versioning, and securely sharing AI/ML models, datasets, code, and configuration. Packages everything into OCI artifacts stored in existing container registries. Apache 2.0 licensed |
| KoboldCpp | Inference Engines & ServingLocal / On-device Inference | — | — | TOOL | User-friendly llama.cpp fork focused on role-playing and creative writing |
| kohya_ss | Training & Fine-tuning EcosystemFull Training Frameworks | — | — | TOOL | Gradio-based GUI and CLI for training Stable Diffusion models (LoRA, Dreambooth, fine-tuning, SDXL). Provides accessible interface to Kohya's powerful training scripts |
| Kornia | Specialized DomainsComputer Vision | — | — | TOOL | Differentiable computer vision library |
| Kotaemon (Cinnamon) | Retrieval-Augmented Generation (RAG) & KnowledgeRAG Frameworks & Advanced Retrieval Tools | — | Apache-2.0 | TOOL | Open-source RAG-based tool for chatting with your documents. Hybrid RAG pipeline with full-text and vector retriever, re-ranking, and multi-modal capabilities. Clean Gradio-based UI with support for local and API-based LLMs. Apache 2.0 licensed |
| Krita AI Diffusion | Generative Media ToolsImage Generation & Editing | — | — | TOOL | Streamlined AI image generation plugin for Krita. Inpaint and outpaint with optional text prompt, no tweaking required. Integrates ComfyUI backend for professional digital painting workflows. GPL-3.0 licensed |
| KServe | MLOps / LLMOps & ProductionDeployment & Orchestration | — | — | TOOL | Kubernetes-based model serving |
| KTransformers | Inference Engines & ServingHigh-performance Serving & API Servers | — | — | TOOL | Flexible framework for heterogeneous CPU-GPU LLM inference and fine-tuning. Enables running large MoE models by offloading experts to CPU with BF16/FP8 precision support |
| Kubeflow | MLOps / LLMOps & ProductionDeployment & Orchestration | — | — | TOOL | Kubernetes-native ML/LLM platform |
| Kubeflow Pipelines | MLOps / LLMOps & ProductionDeployment & Orchestration | — | Apache-2.0 | TOOL | Machine Learning Pipelines for Kubeflow. Platform for building and deploying portable, scalable ML workflows using Kubernetes and Argo. Apache 2.0 licensed |
| Kueue | MLOps / LLMOps & ProductionDeployment & Orchestration | — | Apache-2.0 | TOOL | Kubernetes-native job queueing system for batch, HPC, AI/ML, and similar applications. Cloud-native job queueing with resource flavor fungibility, fair sharing, cohorts, and preemption policies. Integrates with Kubeflow, Ray, and JobSet. Apache 2.0 licensed |
| Label Studio | Core Frameworks & LibrariesData Processing & Manipulation | — | Apache-2.0 | TOOL | Multi-type data labeling and annotation tool with standardized output format. Configurable interface for images, text, audio, video, and time series with ML-assisted labeling. Apache 2.0 licensed |
| lakeFS | Core Frameworks & LibrariesData Processing & Manipulation | — | Apache-2.0 | TOOL | Data version control for your data lake that transforms object storage into Git-like repositories. Enables atomic, versioned data lake operations with branching, committing, and merging for data pipelines. Apache 2.0 licensed |
| LanceDB | Retrieval-Augmented Generation (RAG) & KnowledgeVector Databases & Search Engines | — | — | TOOL | Serverless vector DB optimized for multimodal data |
| LangChain | Agentic AI & Multi-Agent SystemsSingle-Agent Frameworks | — | — | AGENT | Foundational library for agents, chains, and memory |
| LangChain Academy | Resources & LearningCourses & Interactive Playgrounds | — | — | TOOL | Free courses on agents and RAG |
| LangChain.rb | Retrieval-Augmented Generation (RAG) & KnowledgeLLM Application Frameworks | — | MIT | TOOL | Build LLM-powered applications in Ruby. Idiomatic Ruby library for building AI applications with support for multiple LLM providers, vector stores, and RAG pipelines. MIT licensed |
| LangChain4j | Retrieval-Augmented Generation (RAG) & KnowledgeRAG Frameworks & Advanced Retrieval Tools | — | Apache-2.0 | TOOL | Java library for integrating LLMs into Java applications. Implements RAG, tool calling (including MCP support), and agents with seamless integration into enterprise Java frameworks like Spring Boot. Apache 2.0 licensed |
| Langflow | Agentic AI & Multi-Agent SystemsDomain-Specific Agents | — | — | AGENT | Visual low-code platform for agentic workflows |
| Langfuse | MLOps / LLMOps & ProductionMonitoring, Evaluation & Observability | — | — | TOOL | #1 open-source LLM observability platform |
| LangGPT | Agentic AI & Multi-Agent SystemsPrompt Engineering & Structured Outputs | — | Apache-2.0 | AGENT | Pioneering framework for structured and meta-prompt design. Battle-tested by thousands of users worldwide with 10,000+ stars. The most popular prompt engineering paradigm for creating reusable, maintainable prompt templates. Apache 2.0 licensed |
| LangGraph | Agentic AI & Multi-Agent SystemsSingle-Agent Frameworks | — | — | AGENT | Stateful, controllable agent orchestration |
| Langroid | Agentic AI & Multi-Agent SystemsSingle-Agent Frameworks | — | MIT | AGENT | Harness LLMs with multi-agent programming. Mature tool calling system based on Pydantic, supports hundreds of LLM providers including OpenAI and local servers. Built for robust agent behavior in real-world use cases. MIT licensed |
| Large Language Model Notebooks Course | Resources & LearningCourses & Interactive Playgrounds | — | MIT | TOOL | Practical hands-on course about Large Language Models and their applications. Covers Chatbots, Code Generation, OpenAI API, Hugging Face, Vector databases, LangChain, Fine Tuning, PEFT, LoRA, QLoRA. MIT licensed |
| Latitude | MLOps / LLMOps & ProductionMonitoring, Evaluation & Observability | — | — | TOOL | Open-source agent engineering platform with prompt management, evaluations, and optimization. Features prompt playground, LLM-as-judge evals, and GEPA prompt optimizer for production LLM features. LGPL-3.0 licensed |
| Learn PyTorch for Deep Learning (Zero to Mastery) | Resources & LearningCourses & Interactive Playgrounds | — | — | TOOL | Comprehensive PyTorch deep learning course with hundreds of exercises and real-world projects |
| Leon | User Interfaces & Self-hosted PlatformsLocal AI Chat UIs & Personal Assistants | — | MIT | TOOL | Your open-source personal assistant. Built around tools, context, memory, and agentic execution. Self-hosted, privacy-focused, and extensible. MIT licensed |
| LeRobot | Training & Fine-tuning EcosystemFull Training Frameworks | — | Apache-2.0 | TOOL | Making AI for robotics more accessible with end-to-end learning. State-of-the-art approaches for imitation learning and reinforcement learning with pretrained models, datasets, and simulated environments. Apache 2.0 licensed |
| Letta (ex-MemGPT) | Agentic AI & Multi-Agent SystemsAgent Memory & State | — | — | AGENT | Platform for building stateful agents with advanced memory that learn and self-improve over time |
| Letta Code | Agentic AI & Multi-Agent SystemsAutonomous Coding Agents | — | Apache-2.0 | AGENT | Memory-first coding harness designed for long-lived agents that learn from experience. Persistent agents with portable memory across models (Claude, GPT, Gemini, GLM, Kimi). CLI and desktop app for macOS, Windows, and Linux. Apache 2.0 licensed |
| LibreChat | User Interfaces & Self-hosted PlatformsLocal AI Chat UIs & Personal Assistants | — | — | TOOL | Feature-packed multi-LLM interface |
| LichtFeld-Studio | Generative Media Tools3D & Creative Tools | — | — | TOOL | Native application for training, editing, and exporting 3D Gaussian Splatting scenes with MCMC optimization and timelapse generation. GPL-3.0 licensed |
| Liger Kernel | Training & Fine-tuning EcosystemLoRA / PEFT Tools | — | — | TOOL | Ultra-fast custom kernels for training speedup |
| Lighteval | Evaluation, Benchmarks & DatasetsEvaluation Frameworks | — | — | TOOL | Evaluation toolkit for LLMs across multiple backends with reusable tasks, metrics, and result tracking |
| LightGBM | Core Frameworks & LibrariesClassical ML & Gradient Boosting | — | — | TOOL | Microsoft's ultra-fast gradient boosting framework, optimized for speed and memory |
| LightLLM | Inference Engines & ServingHigh-performance Serving & API Servers | — | — | TOOL | Pure Python-based LLM inference and serving framework with lightweight design, easy extensibility, and high-speed performance. Integrates optimizations from FasterTransformer, TGI, vLLM, and SGLang |
| Lightpanda | Retrieval-Augmented Generation (RAG) & KnowledgeWeb Data Ingestion | — | — | TOOL | Machine-first headless browser in Zig; rendering-free and ultra-lightweight for AI agent browsing |
| LightRAG | Retrieval-Augmented Generation (RAG) & KnowledgeRAG Frameworks & Advanced Retrieval Tools | — | — | TOOL | Graph-based RAG with dual-level retrieval system. Simple and fast with comprehensive knowledge discovery (EMNLP 2025) |
| linfa | Core Frameworks & LibrariesRust ML Frameworks | Rust | — | TOOL | Comprehensive Rust ML toolkit with classical algorithms. scikit-learn equivalent for Rust with clustering, regression, and preprocessing |
| LiteLLM | MLOps / LLMOps & ProductionMonitoring, Evaluation & Observability | — | — | TOOL | AI Gateway to call 100+ LLM APIs in OpenAI format with unified cost tracking, guardrails, load balancing, and logging |
| LiteRT-LM | Inference Engines & ServingLocal / On-device Inference | — | Apache-2.0 | TOOL | Google's production-ready inference framework for deploying LLMs on edge devices. Cross-platform support for Android, iOS, Web, Desktop, and IoT with GPU/NPU acceleration. Powers on-device GenAI in Chrome and Chromebook Plus. Apache 2.0 licensed |
| LitGPT | Training & Fine-tuning EcosystemFull Training Frameworks | — | — | TOOL | Clean from-scratch implementations of 20+ LLMs |
| LitServe (Lightning AI) | Inference Engines & ServingHigh-performance Serving & API Servers | — | Apache-2.0 | TOOL | Minimal Python framework for building custom AI inference servers with full control over logic, batching, and scaling. 2x faster than FastAPI with built-in batching, streaming, and multi-GPU autoscaling. Apache 2.0 licensed |
| LiveBench | Evaluation, Benchmarks & DatasetsBenchmark Suites | — | — | BENCHMARK | Contamination-free LLM benchmark with objective ground-truth scoring. ICLR 2025 spotlight paper featuring frequently-updated questions from recent sources. Tests math, coding, reasoning, language, instruction following, and data analysis |
| LiveKit Agents | User Interfaces & Self-hosted PlatformsAgent & Voice Infrastructure | — | Apache-2.0 | TOOL | Framework for building realtime voice AI agents with WebRTC transport, STT-LLM-TTS pipelines, and production-grade orchestration. Used by Salesforce Agentforce and Tesla. Apache-2.0 licensed |
| Llama 4 (Meta) | Open Foundation ModelsLarge Language Models (Base + Chat) | — | — | MODEL | First native multimodal MoE open-source models (Scout: 10M context, Maverick: 400B+ params). Released April 2025 with enterprise-grade capabilities |
| llama-cpp-python | Inference Engines & ServingLocal / On-device Inference | — | — | TOOL | Official Python bindings for llama.cpp |
| LLaMA-Factory | Training & Fine-tuning EcosystemFull Training Frameworks | — | — | TOOL | One-stop unified framework for SFT, DPO, ORPO, KTO with web UI |
| llama-swap | Inference Engines & ServingAdditional Inference Engines | — | MIT | TOOL | Intelligent model swapping proxy for llama.cpp. Enables seamless hot-swapping between different GGUF models without restarting the server, with automatic model loading/unloading and OpenAI-compatible API. MIT licensed |
| llama.cpp | Inference Engines & ServingLocal / On-device Inference | — | — | TOOL | Pure C/C++ inference engine with GGUF format support. The gold standard for CPU/GPU/Apple Silicon on-device running. Includes llama-server for OpenAI-compatible API. Now at 100K+ stars |
| llama.vim | Developer Tools & IntegrationsIDE Plugins & Extensions | — | — | TOOL | Local LLM-powered code completion plugin for Vim/Neovim using llama.cpp. Fast, privacy-first, no API key needed |
| llamafile | Inference Engines & ServingHigh-performance Serving & API Servers | — | — | TOOL | Mozilla's single-file distributable LLM solution. Bundle model weights, inference engine, and runtime into one portable executable that runs on six OSes without installation |
| LlamaIndex | Retrieval-Augmented Generation (RAG) & KnowledgeRAG Frameworks & Advanced Retrieval Tools | — | — | TOOL | Full-featured RAG pipeline with advanced indexing |
| LLM (Simon Willison) | Developer Tools & IntegrationsCLI Tools & API Clients | — | Apache-2.0 | TOOL | CLI tool and Python library for interacting with dozens of LLMs via remote APIs or locally. Extensible plugin ecosystem, SQLite logging. Apache 2.0 licensed |
| LLM Compressor (vLLM) | Training & Fine-tuning EcosystemModel Quantization & Optimization | — | — | TOOL | Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM. Supports GPTQ, AWQ, SmoothQuant, AutoRound, and FP8/INT8 quantization with seamless Hugging Face integration |
| LLM Course (Maxime Labonne) | Resources & LearningCourses & Interactive Playgrounds | — | — | TOOL | End-to-end course for getting into Large Language Models with roadmaps and Colab notebooks. Covers pre-training, fine-tuning, RLHF, quantization, and prompt engineering |
| LLM Foundry | Training & Fine-tuning EcosystemFull Training Frameworks | — | — | TOOL | Databricks' training framework for composable LLM training with StreamingDataset and Composer |
| LLM Guard | MLOps / LLMOps & ProductionGuardrails & Safety Tools | — | MIT | TOOL | Comprehensive security toolkit for LLM interactions with input/output scanners for prompt injection, PII anonymization, toxic content, secrets detection, and adversarial attack prevention. MIT licensed |
| llm-d | Inference Engines & ServingHigh-performance Serving & API Servers | — | — | TOOL | Kubernetes-native distributed LLM inference framework. Donated to CNCF by RedHat, Google, and IBM. Intelligent scheduling, KV-cache optimization, and state-of-the-art performance across accelerators |
| llmware | Retrieval-Augmented Generation (RAG) & KnowledgeRAG Frameworks & Advanced Retrieval Tools | — | Apache-2.0 | TOOL | Unified framework for building enterprise RAG pipelines with small, specialized models. Optimized for AI PC and local deployment with 300+ models in catalog. Apache 2.0 licensed |
| LM Format Enforcer | Agentic AI & Multi-Agent SystemsPrompt Engineering & Structured Outputs | — | MIT | AGENT | Enforce output format (JSON Schema, Regex, etc) of language models by filtering allowed tokens at each generation step. Compatible with Hugging Face, llama-cpp-python, and vLLM. MIT licensed |
| lm-evaluation-harness (EleutherAI) | Evaluation, Benchmarks & DatasetsBenchmark Suites | — | — | BENCHMARK | De-facto standard for generative model evaluation |
| LMCache | Inference Engines & ServingHigh-performance Serving & API Servers | — | Apache-2.0 | TOOL | Supercharge LLM inference with the fastest KV Cache layer. 3-10x delay savings and GPU cycle reduction for multi-round QA and RAG. Integrates seamlessly with vLLM for distributed, high-throughput deployments. Apache 2.0 licensed |
| LMDeploy | Inference Engines & ServingHigh-performance Serving & API Servers | — | — | TOOL | Toolkit for compressing, deploying, and serving LLMs from OpenMMLab. 4-bit inference with 2.4x higher performance than FP16, distributed multi-model serving across machines |
| LMFlow | Training & Fine-tuning EcosystemFull Training Frameworks | — | Apache-2.0 | TOOL | Extensible toolkit for finetuning and inference of large foundation models. Features RAFT alignment algorithm and comprehensive model support. Apache 2.0 licensed |
| LMMs-Eval | Evaluation, Benchmarks & DatasetsEvaluation Frameworks | — | — | TOOL | Unified multimodal evaluation toolkit for text, image, video, and audio tasks with 100+ supported benchmarks |
| LobeChat | User Interfaces & Self-hosted PlatformsLocal AI Chat UIs & Personal Assistants | — | — | TOOL | Sleek modern chat UI |
| LocalAI | User Interfaces & Self-hosted PlatformsFull Self-hosted AI Platforms | — | MIT | TOOL | Open-source AI engine running LLMs, vision, voice, image, and video models on any hardware. Self-hosted OpenAI-compatible API. MIT licensed |
| localGPT | Retrieval-Augmented Generation (RAG) & KnowledgeRAG Frameworks & Advanced Retrieval Tools | — | — | TOOL | Local document-chat project for private, on-device Q&A over files without sending data to external APIs |
| LTX-Video (Lightricks) | Generative Media ToolsVideo Generation | — | — | TOOL | Fast native 4K video generation |
| Ludwig | Training & Fine-tuning EcosystemFull Training Frameworks | — | Apache-2.0 | TOOL | Low-code framework for building custom LLMs and deep neural networks. Declarative YAML configuration for training state-of-the-art models with PEFT/LoRA, 4-bit quantization, distributed training via Hugging Face Accelerate, and native Kubernetes support. Linux Foundation AI project. Apache 2.0 licensed |
| Luigi | Core Frameworks & LibrariesData Processing & Manipulation | — | Apache-2.0 | TOOL | Python module for building complex pipelines of batch jobs. Handles dependency resolution, workflow management, visualization, and Hadoop integration. Built at Spotify and battle-tested in production. Apache 2.0 licensed |
| Made With ML (Goku Mohandas) | Resources & LearningCourses & Interactive Playgrounds | — | — | TOOL | End-to-end course on building production-grade ML systems with MLOps fundamentals, from design to deployment and iteration |
| Mage.ai | Core Frameworks & LibrariesData Processing & Manipulation | — | Apache-2.0 | TOOL | Modern open-source data pipeline tool for integrating and transforming data. AI-native ETL/ELT platform with 100+ integrations, real-time monitoring, and collaborative features. Apache 2.0 licensed |
| Magma (Microsoft) | Open Foundation ModelsMultimodal Models (Vision + Language) | — | — | MODEL | Foundation model for multimodal AI agents that perceives the world and takes goal-driven actions across digital and physical environments. CVPR 2025 |
| Maid | User Interfaces & Self-hosted PlatformsDesktop & Mobile AI Apps | — | MIT | TOOL | Free and open-source Android app for interfacing with llama.cpp models locally and remote APIs (Anthropic, DeepSeek, Mistral, Ollama, OpenAI). MIT licensed |
| Mamba (State Space Models) | Open Foundation ModelsLarge Language Models (Base + Chat) | — | Apache-2.0 | MODEL | Novel State Space Model architecture with linear-time inference and transformer-level performance. 100% attention-free with constant memory usage, enabling efficient long-sequence modeling. Pretrained models from 130M to 2.8B parameters trained on 300B-600B tokens. Apache 2.0 licensed |
| Manticore Search | Retrieval-Augmented Generation (RAG) & KnowledgeVector Databases & Search Engines | — | — | TOOL | Easy to use open source fast database for search. Good alternative to Elasticsearch with SQL-like interface and vector search capabilities |
| Marimo | Core Frameworks & LibrariesInteractive ML Apps & Notebooks | — | — | TOOL | A reactive notebook for Python — run reproducible experiments, query with SQL, execute as a script, deploy as an app, and version with git. Stored as pure Python. All in a modern, AI-native editor |
| Marker | Retrieval-Augmented Generation (RAG) & KnowledgeRAG Frameworks & Advanced Retrieval Tools | — | — | TOOL | Fast, accurate PDF-to-markdown converter with table extraction, equation handling, and optional LLM enhancement for RAG pipelines |
| MarkItDown (Microsoft) | Retrieval-Augmented Generation (RAG) & KnowledgeDocument Conversion & Preprocessing | — | MIT | TOOL | Python tool for converting files and office documents to Markdown. Supports PDF, PowerPoint, Word, Excel, images, audio, HTML, and more with OCR and transcription capabilities. MIT licensed |
| Marqo | Retrieval-Augmented Generation (RAG) & KnowledgeVector Databases & Search Engines | — | Apache-2.0 | TOOL | Multimodal vector search for text, image, and structured data. End-to-end indexing and search with built-in embedding models. Apache 2.0 licensed |
| Marquez | MLOps / LLMOps & ProductionExperiment Tracking & Versioning | — | Apache-2.0 | TOOL | LF AI & Data Foundation Graduated project for metadata collection, aggregation, and visualization. Maintains provenance of how datasets are consumed and produced with global visibility into job runtime and dataset lifecycle management. Integrates with OpenLineage. Apache 2.0 licensed |
| Marvin | Agentic AI & Multi-Agent SystemsSingle-Agent Frameworks | — | Apache-2.0 | AGENT | Python framework for structured outputs and agentic AI workflows. Simplifies LLM interactions with type-safe interfaces, automatic schema generation, and built-in observability. From the creators of Prefect. Apache 2.0 licensed |
| Mastra | Agentic AI & Multi-Agent SystemsMulti-Agent Orchestration | — | — | AGENT | TypeScript-first agent framework with built-in RAG, workflows, tool integrations, observability and observational memory |
| MaxKB | Retrieval-Augmented Generation (RAG) & KnowledgeRAG Frameworks & Advanced Retrieval Tools | — | — | TOOL | Self-hostable knowledge-base and agent platform for document ingestion, RAG pipelines, and enterprise assistant workflows |
| mcp-agent | Agentic AI & Multi-Agent SystemsMulti-Agent Orchestration | — | MIT | AGENT | Build effective agents using Model Context Protocol and simple workflow patterns. Handles connection mechanics, LLM integration, and persistent state for production MCP-based agents. MIT licensed |
| MediaPipe | Specialized DomainsComputer Vision | — | — | TOOL | Cross-platform multimodal pipelines |
| Megatron-LM | Training & Fine-tuning EcosystemDistributed Training | — | — | TOOL | Distributed training framework and reference codebase for large transformer models at scale |
| Meilisearch | Retrieval-Augmented Generation (RAG) & KnowledgeVector Databases & Search Engines | — | MIT | TOOL | Lightning-fast search engine API with AI-powered hybrid search. Features typo-tolerant full-text search combined with HNSW-based vector search for semantic retrieval. MIT licensed |
| Mem0 | Agentic AI & Multi-Agent SystemsAgent Memory & State | — | — | AGENT | Universal memory layer for AI agents. Persistent, multi-session memory across models and environments |
| MergeKit | Training & Fine-tuning EcosystemLoRA / PEFT Tools | — | — | TOOL | Advanced model merging tools |
| Metaflow | MLOps / LLMOps & ProductionDeployment & Orchestration | — | Apache-2.0 | TOOL | Netflix's ML platform for building and managing real-world AI systems. Powers thousands of projects at Netflix, Amazon, and DoorDash. Apache 2.0 licensed |
| MetaGPT | Agentic AI & Multi-Agent SystemsMulti-Agent Orchestration | — | MIT | AGENT | The Multi-Agent Framework: First AI Software Company. Assigns different roles to GPTs to form a collaborative software entity. Takes one-line requirements and outputs comprehensive software development artifacts including user stories, competitive analysis, requirements, data structures, APIs, and documents. ICLR 2024 oral presentation (top 1.2%). MIT licensed |
| Microsoft Agent Framework | Agentic AI & Multi-Agent SystemsMulti-Agent Orchestration | — | — | AGENT | Microsoft's official framework combining AutoGen's agent abstractions with Semantic Kernel's enterprise features. Supports Python and .NET with graph-based workflows |
| Microsoft BitNet | Inference Engines & ServingHigh-performance Serving & API Servers | — | MIT | TOOL | Official inference framework for 1-bit LLMs (BitNet b1.58). Enables running large models on CPU with minimal memory footprint. Features custom kernels for ternary weight quantization and efficient matmul operations. MIT licensed |
| Microsoft PromptFlow | MLOps / LLMOps & ProductionMonitoring, Evaluation & Observability | — | MIT | TOOL | Comprehensive suite for LLM-based AI app development from prototyping to production. Includes prompt engineering, evaluation, and deployment tools with VS Code integration. MIT licensed |
| Milvus | Retrieval-Augmented Generation (RAG) & KnowledgeVector Databases & Search Engines | — | — | TOOL | Scalable cloud-native vector database |
| MiMo-V2-Flash (Xiaomi) | Open Foundation ModelsLarge Language Models (Base + Chat) | — | Apache-2.0 | MODEL | 309B MoE model (15B active) with hybrid attention and Multi-Token Prediction for efficient high-speed reasoning. Apache 2.0 licensed |
| MinerU | Retrieval-Augmented Generation (RAG) & KnowledgeRAG Frameworks & Advanced Retrieval Tools | — | — | TOOL | High-accuracy document parsing for LLM and RAG workflows. Converts PDFs, Word, PPTs, and images into structured Markdown/JSON with VLM+OCR dual engine |
| mini-sglang | Inference Engines & ServingAdditional Inference Engines | — | MIT | TOOL | Compact implementation of SGLang designed to demystify modern LLM serving systems. Educational yet production-quality with RadixAttention, continuous batching, and speculative decoding. MIT licensed |
| mini-SWE-agent | Agentic AI & Multi-Agent SystemsAutonomous Coding Agents | — | — | AGENT | Lightweight coding agent for repository and issue-fixing workflows, designed for simple agentic software engineering experiments |
| MiniCPM-o 2.6 | Open Foundation ModelsMultimodal Models (Vision + Language) | — | Apache-2.0 | MODEL | Gemini 2.5 Flash level MLLM for vision, speech, and full-duplex multimodal live streaming on your phone. Apache 2.0 licensed |
| MiniCPM-V (OpenBMB) | Open Foundation ModelsAdditional Vision-Language Models | — | Apache-2.0 | MODEL | GPT-4V level multimodal LLM for single image, multi-image and high-FPS video understanding on edge devices. 8B parameters with superior OCR and reasoning capabilities. Apache 2.0 licensed |
| MiniMind | Training & Fine-tuning EcosystemFull Training Frameworks | — | Apache-2.0 | TOOL | Train a 64M-parameter LLM from scratch in just 2 hours for $3. Complete from-scratch implementation covering MoE, data cleaning, pretraining, SFT, LoRA, RLHF (DPO/PPO/GRPO), tool use, and model distillation. All core algorithms implemented in pure PyTorch without high-level abstractions. Educational framework for understanding LLM internals. Apache 2.0 licensed |
| Minuet AI | Developer Tools & IntegrationsIDE Plugins & Extensions | — | — | TOOL | Neovim plugin offering code completion as-you-type from popular LLMs including OpenAI, Gemini, Claude, Ollama, Llama.cpp, Codestral, and more. GPL-3.0 licensed |
| Mirascope | Developer Tools & IntegrationsCLI Tools & API Clients | — | MIT | TOOL | Python toolkit for building LLM applications with automatic versioning, tracing, and cost tracking. The "LLM Anti-Framework" for developers who want control. MIT licensed |
| Mistral-Vibe (Mistral) | Agentic AI & Multi-Agent SystemsAutonomous Coding Agents | — | — | AGENT | Minimal CLI coding agent by Mistral. Lightweight, fast, and designed for local development workflows |
| mistral.rs | Inference Engines & ServingHigh-performance Serving & API Servers | — | — | TOOL | Fast, flexible Rust-native LLM inference engine built on Candle. Supports text, vision, audio, image generation, and embeddings with hardware-aware auto-tuning |
| ML For Beginners (Microsoft) | Resources & LearningCourses & Interactive Playgrounds | — | — | TOOL | 12-week, 26-lesson, 52-quiz classic machine learning course for beginners. Comprehensive curriculum covering regression, classification, clustering, and NLP with practical projects |
| MLC-LLM | Inference Engines & ServingLocal / On-device Inference | — | — | TOOL | Deployment engine that compiles and runs LLMs across browsers, mobile devices, and local hardware |
| MLE-bench (OpenAI) | Evaluation, Benchmarks & DatasetsBenchmark Suites | — | MIT | BENCHMARK | Benchmark for measuring how well AI agents perform at machine learning engineering. Evaluates agents on 75 Kaggle competitions covering diverse ML tasks. MIT licensed |
| MLflow | MLOps / LLMOps & ProductionExperiment Tracking & Versioning | — | — | TOOL | End-to-end open platform for the ML/LLM lifecycle |
| MLForecast | Core Frameworks & LibrariesClassical ML & Gradient Boosting | — | Apache-2.0 | TOOL | Scalable machine learning for time series forecasting. Train any sklearn-compatible model on millions of time series with efficient feature engineering. Apache 2.0 licensed |
| MLJ.jl | Core Frameworks & LibrariesJulia ML Frameworks | Julia | MIT | TOOL | Comprehensive Julia machine learning framework providing a unified interface to 200+ models with meta-algorithms for selection, tuning, and evaluation. MIT licensed |
| mllm | Inference Engines & ServingAdditional Inference Engines | — | MIT | TOOL | Fast and lightweight multimodal LLM inference engine for mobile and edge devices. Optimized for running vision-language models on resource-constrained hardware with efficient memory management. MIT licensed |
| MLPerf Inference | Evaluation, Benchmarks & DatasetsBenchmark Suites | — | — | BENCHMARK | Industry-standard ML inference benchmarks with reference implementations for AI accelerators |
| MLPerf Training | Evaluation, Benchmarks & DatasetsBenchmark Suites | — | Apache-2.0 | BENCHMARK | Industry-standard ML training benchmarks from MLCommons. Reference implementations for training AI models at scale across image classification, object detection, NLP, and recommendation tasks. Apache 2.0 licensed |
| MLRun | MLOps / LLMOps & ProductionDeployment & Orchestration | — | Apache-2.0 | TOOL | Open-source AI orchestration platform for quickly building and managing continuous ML and generative AI applications across their lifecycle. Automates data preparation, model tuning, and deployment. Apache 2.0 licensed |
| MLX | Core Frameworks & LibrariesDeep Learning Frameworks | — | MIT | TOOL | Array framework for machine learning on Apple silicon. Efficient unified memory design with NumPy-like API, automatic differentiation, and multi-device support. MIT licensed |
| MMaDA (Gen-Verse) | Open Foundation ModelsMultimodal Models (Vision + Language) | — | MIT | MODEL | Open-sourced multimodal large diffusion language model with unified architecture for text, image generation and multimodal reasoning. MIT licensed, NeurIPS 2025 |
| MNN | Specialized DomainsEdge / On-device AI | — | Apache-2.0 | TOOL | Blazing-fast, lightweight inference engine battle-tested by Alibaba. Supports inference and training with industry-leading on-device performance. Powers high-performance LLMs and Edge AI with MNN-LLM runtime. Apache 2.0 licensed |
| MobileAgent (Alibaba/X-PLUG) | Agentic AI & Multi-Agent SystemsDomain-Specific Agents | — | MIT | AGENT | Powerful GUI agent family for autonomous mobile device control. Multimodal agent framework designed to operate smartphone apps through visual UI perception and reasoning. MIT licensed |
| Mochi 1 (Genmo) | Open Foundation ModelsVideo & Animation Models | — | — | MODEL | 10B open video model with impressive motion and consistency |
| ModelingToolkit.jl | Core Frameworks & LibrariesJulia ML Frameworks | Julia | MIT | TOOL | High-performance symbolic-numeric modeling framework for scientific machine learning. Automatically generates fast functions for model components like Jacobians and Hessians with automatic sparsification and parallelization. MIT licensed |
| ModelScope | MLOps / LLMOps & ProductionModel Hubs & Registries | — | Apache-2.0 | TOOL | Model-as-a-Service platform bringing together 700+ state-of-the-art ML models from the AI community. Covers NLP, CV, Audio, Multi-modality, and AI for Science with streamlined model inference, fine-tuning and evaluation. Apache 2.0 licensed |
| Modin | Core Frameworks & LibrariesData Processing & Manipulation | — | — | TOOL | Parallel Pandas DataFrames. Scale Pandas workflows by changing a single line of code - distributes data and computation automatically |
| MONAI | Specialized DomainsMedical Imaging & Healthcare AI | — | Apache-2.0 | TOOL | Medical Open Network for AI. End-to-end framework for healthcare imaging with state-of-the-art, production-ready training workflows. Apache 2.0 licensed |
| Mooncake | Inference Engines & ServingAdditional Inference Engines | — | Apache-2.0 | TOOL | Production-grade serving platform for Kimi (Moonshot AI). Features distributed KV cache pool with intelligent offloading, prefill/decode disaggregation, and cross-instance KV reuse. Integrated with vLLM, SGLang, and TensorRT-LLM. Apache 2.0 licensed |
| Moondream (m87-labs) | Open Foundation ModelsMultimodal Models (Vision + Language) | — | Apache-2.0 | MODEL | Tiny vision language model (0.5B and 2B parameters) that runs anywhere. Powerful image understanding with remarkably small footprint for edge devices and real-time applications. Apache 2.0 licensed |
| Morphic | User Interfaces & Self-hosted PlatformsLocal AI Chat UIs & Personal Assistants | — | Apache-2.0 | TOOL | AI-powered search engine with a generative UI. Supports multiple AI providers (OpenAI, Anthropic, Google, Ollama) and search providers (Tavily, SearXNG, Brave). Features smart search modes, widgets, and image/video search. Apache 2.0 licensed |
| Morphik | Retrieval-Augmented Generation (RAG) & KnowledgeRAG Frameworks & Advanced Retrieval Tools | — | MIT | TOOL | Open-source multimodal RAG framework for building AI apps over private knowledge. Handles text, images, and documents with built-in embedding generation and vector search. MIT licensed |
| MoveIt 2 | Specialized Domains3D Vision & Point Cloud Processing | — | BSD-3-Clause | TOOL | Open source robotics manipulation framework for ROS 2. Motion planning, manipulation, 3D perception, kinematics, control, and navigation for robotic arms. BSD-3-Clause licensed |
| ms-swift | Training & Fine-tuning EcosystemFull Training Frameworks | — | — | TOOL | Unified training framework for 600+ LLMs and 300+ MLLMs with CPT/SFT/DPO/GRPO (AAAI 2025) |
| MTEB | Retrieval-Augmented Generation (RAG) & KnowledgeEmbedding Benchmarks | — | — | TOOL | Massive Text Embedding Benchmark covering 1000+ languages and diverse tasks. The industry standard for evaluating and comparing embedding models |
| MuJoCo | Specialized DomainsReinforcement Learning & Robotics | — | Apache-2.0 | TOOL | General-purpose physics simulator for robotics, biomechanics, and ML research. High-fidelity contact dynamics with native Python and C++ bindings. Apache 2.0 licensed |
| MusicGen / AudioCraft (Meta) | Open Foundation ModelsSpeech & Audio Models (TTS, STT, Music) | — | — | MODEL | Open music and audio generation models |
| n8n | Agentic AI & Multi-Agent SystemsDomain-Specific Agents | — | — | AGENT | Self-hostable workflow automation platform with AI agent nodes, tool integrations, and production automation workflows |
| nano-vLLM | Inference Engines & ServingHigh-performance Serving & API Servers | — | MIT | TOOL | Minimalist vLLM implementation in ~1,200 lines of Python. Educational yet performant with prefix caching, tensor parallelism, and CUDA graph acceleration. Comparable inference speeds to full vLLM. MIT licensed |
| Nanocoder (Nano-Collective) | Agentic AI & Multi-Agent SystemsAutonomous Coding Agents | — | — | AGENT | Beautiful local-first coding agent running in your terminal. Built for privacy and control with support for multiple AI providers via OpenRouter |
| nanoflann | Retrieval-Augmented Generation (RAG) & KnowledgeVector Databases & Search Engines | — | BSD | TOOL | C++11 header-only library for fast nearest neighbor search with KD-trees. Zero dependencies, single-file integration, and 2-3x faster than FLANN with modern C++. BSD licensed |
| nanoGPT (Andrej Karpathy) | Training & Fine-tuning EcosystemFull Training Frameworks | — | MIT | TOOL | The simplest, fastest repository for training/finetuning medium-sized GPTs. Clean, minimal, and hackable codebase for understanding transformer training from scratch. MIT licensed |
| Nanotron (Hugging Face) | Training & Fine-tuning EcosystemDistributed Training | — | — | TOOL | Minimalistic 3D-parallelism LLM pretraining with tensor, pipeline, and data parallelism. Designed for simplicity and speed |
| Narwhals | Core Frameworks & LibrariesData Processing & Manipulation | — | MIT | TOOL | Lightweight compatibility layer between DataFrame libraries. Write Polars-like code that works seamlessly across Pandas, Polars, cuDF, Modin, and more. MIT licensed |
| NASA Astrobee | Specialized DomainsAutonomous Driving & Robotics Simulators | — | Apache-2.0 | TOOL | NASA's free-flying robot software for the International Space Station. Flight software for vision-based localization, autonomous navigation, docking, and human-robot interaction. NASA Software of the Year Award Runner-Up 2020. Apache 2.0 licensed |
| NCNN | Specialized DomainsEdge / On-device AI | — | BSD-3-Clause | TOOL | High-performance neural network inference framework optimized for mobile platforms. No third-party dependencies, cross-platform, and runs faster than all known open-source frameworks on mobile CPU. Powers Tencent apps including QQ, WeChat, and Pitu. BSD-3-Clause licensed |
| NeMo Guardrails (NVIDIA) | AI Safety, Alignment & InterpretabilityAdversarial & Red-teaming Tools | — | Apache-2.0 | TOOL | Programmable guardrails toolkit for LLM-based conversational systems. Uses Colang DSL to define safety rules, dialog flows, and content boundaries. Integrates with LangChain, LangGraph, and LlamaIndex for production deployments. Apache 2.0 licensed |
| NeMo-RL | Training & Fine-tuning EcosystemFull Training Frameworks | — | — | TOOL | Scalable toolkit for efficient model reinforcement with DTensor and Megatron backends |
| Nemotron (NVIDIA) | Open Foundation ModelsLarge Language Models (Base + Chat) | — | Apache-2.0 | MODEL | Open and efficient models for agentic AI with training recipes, deployment guides, and use-case examples. Apache 2.0 licensed |
| Netflix Maestro | MLOps / LLMOps & ProductionDeployment & Orchestration | — | Apache-2.0 | TOOL | Netflix's next-generation workflow orchestrator for data and ML pipelines at massive scale. Highly scalable and flexible scheduler designed to handle millions of workflows across thousands of nodes. Apache 2.0 licensed |
| NetworkX | Core Frameworks & LibrariesData Processing & Manipulation | — | — | TOOL | Creation, manipulation, and study of complex networks. The foundational graph analysis library for Python data science |
| Neurite | Retrieval-Augmented Generation (RAG) & KnowledgeRAG Frameworks & Advanced Retrieval Tools | — | MIT | TOOL | Fractal Graph-of-Thought mind-mapping for AI agents, web-links, notes, and code. Rhizomatic workspace blending chaos theory, graph theory, and fractal logic for creative thinking and RAG workflows. MIT licensed |
| Neuron AI | Agentic AI & Multi-Agent SystemsSingle-Agent Frameworks | — | MIT | AGENT | PHP Agentic Framework for building production-ready AI driven applications. Connect components (LLMs, vector DBs, memory) to agents that can interact with your data. MIT licensed |
| Newelle | User Interfaces & Self-hosted PlatformsLocal AI Chat UIs & Personal Assistants | — | — | TOOL | GNOME/Linux desktop virtual assistant with integrated file editor, global hotkeys, and profile manager |
| NextChat | User Interfaces & Self-hosted PlatformsLocal AI Chat UIs & Personal Assistants | — | MIT | TOOL | Light and fast AI assistant supporting Web, iOS, macOS, Android, Linux, and Windows. One-click deploy with multi-model support. MIT licensed |
| Nezha | Developer Tools & IntegrationsAI-Native IDEs & Development Environments | — | — | TOOL | Code editor for the AI agents era. Run multiple Claude Code and Codex agents across projects on your machine with an intuitive interface. GPL-3.0 licensed |
| Nimbalyst | Developer Tools & IntegrationsAI-Native IDEs & Development Environments | — | MIT | TOOL | Desktop app for running multiple Codex and Claude Code AI sessions in parallel Git worktrees. Test, compare approaches and manage AI-assisted development workflows in one unified interface. MIT licensed |
| NLP Course (Yandex Data School) | Resources & LearningCourses & Interactive Playgrounds | — | MIT | TOOL | YSDA course in Natural Language Processing with 2025 materials covering text classification, language models, transformers, and modern NLP techniques. MIT licensed |
| NMSLIB | Retrieval-Augmented Generation (RAG) & KnowledgeVector Databases & Search Engines | — | Apache-2.0 | TOOL | Non-Metric Space Library for efficient similarity search in generic non-metric spaces. Comprehensive toolkit for evaluating k-NN methods with support for exotic distance functions. Apache 2.0 licensed |
| nnU-Net | Specialized DomainsMedical Imaging & Healthcare AI | — | Apache-2.0 | TOOL | Self-configuring deep learning method for medical image segmentation. Automatically adapts to any dataset without manual parameter tuning. Widely adopted as the standard baseline for biomedical segmentation challenges. Apache 2.0 licensed |
| NumPy | Core Frameworks & LibrariesData Processing & Manipulation | — | — | TOOL | Fundamental array computing library that powers almost every AI stack |
| NumPyro | Core Frameworks & LibrariesDeep Learning Frameworks | — | — | TOOL | Probabilistic programming with NumPy powered by JAX for autograd and JIT compilation. Bayesian modeling and inference at scale |
| NVIDIA Apex | Core Frameworks & LibrariesModel Training & Optimization Utilities | — | BSD-3-Clause | TOOL | PyTorch extension for mixed precision training and distributed training optimizations. Powers many production deep learning workloads with tools for automatic mixed precision (AMP), distributed data parallel, and fused optimizers. BSD-3-Clause licensed |
| NVIDIA DALI | Core Frameworks & LibrariesData Processing & Manipulation | — | Apache-2.0 | TOOL | GPU-accelerated data loading and augmentation library with highly optimized building blocks for deep learning applications. Apache 2.0 licensed |
| NVIDIA DeepOps | MLOps / LLMOps & ProductionDeployment & Orchestration | — | BSD-3-Clause | TOOL | Infrastructure automation tools for building GPU clusters with Kubernetes and Slurm. Deploys multi-node GPU clusters with monitoring, logging, and storage for AI/HPC workloads. BSD-3-Clause licensed |
| NVIDIA Dynamo | Inference Engines & ServingHigh-performance Serving & API Servers | — | Apache-2.0 | TOOL | Datacenter-scale distributed inference serving framework from NVIDIA. Orchestration layer above vLLM/SGLang/TensorRT-LLM with disaggregated serving, KV-aware routing, and automatic scaling. Built in Rust with Python extensibility. Apache 2.0 licensed |
| NVIDIA KAI Scheduler | MLOps / LLMOps & ProductionDeployment & Orchestration | — | Apache-2.0 | TOOL | Kubernetes-native GPU scheduler for AI workloads at large scale. Originally developed by Run:ai, now open-sourced by NVIDIA. Optimizes GPU resource allocation with dynamic allocation and efficient queue management. Apache 2.0 licensed |
| NVIDIA Model Optimizer | Training & Fine-tuning EcosystemModel Quantization & Optimization | — | — | TOOL | Unified library of SOTA model optimization techniques including quantization, pruning, distillation, and speculative decoding. Compresses deep learning models for deployment with TensorRT-LLM, TensorRT, and vLLM to optimize inference speed across NVIDIA hardware |
| NVIDIA Modulus | Specialized DomainsScientific AI & Physics ML | — | Apache-2.0 | TOOL | Open-source deep learning framework for physics-informed machine learning (Physics-ML). Build, train, and fine-tune models for AI4science and engineering applications using state-of-the-art SciML methods. Apache 2.0 licensed |
| NVIDIA NeMo Speech | Open Foundation ModelsSpeech & Audio Models (TTS, STT, Music) | — | Apache-2.0 | MODEL | Scalable generative AI framework for Speech AI including ASR, TTS, and speech LLMs. Includes state-of-the-art Canary and Parakeet models with 25+ European language support. Apache 2.0 licensed |
| NVTabular | MLOps / LLMOps & ProductionFeature Engineering & Data Preparation | — | Apache-2.0 | TOOL | GPU-accelerated feature engineering and preprocessing library for tabular data. Manipulates terabyte-scale datasets to train deep learning recommender systems. Component of NVIDIA Merlin framework. Apache 2.0 licensed |
| Ollama | Inference Engines & ServingLocal / On-device Inference | — | — | TOOL | Dead-simple local LLM runner with a one-line install, model registry, and OpenAI-compatible API |
| OLMo 2 (Allen AI) | Open Foundation ModelsLarge Language Models (Base + Chat) | — | — | MODEL | Fully open-source LLMs (1B–32B) with complete transparency: models, data, training code, and logs. Designed by scientists, for scientists |
| OmniGen (VectorSpaceLab) | Open Foundation ModelsMultimodal Models (Vision + Language) | — | MIT | MODEL | Unified image generation model that handles text-to-image, subject-driven generation, identity-preserving generation, and image editing from multi-modal prompts without additional plugins. MIT licensed |
| OmniParse | Retrieval-Augmented Generation (RAG) & KnowledgeDocument Conversion & Preprocessing | — | — | TOOL | Ingest and parse any unstructured data into structured, actionable data optimized for GenAI applications. Supports documents, tables, images, videos, audio, and web pages with local deployment on T4 GPU. GPL-3.0 licensed |
| OmniParser (Microsoft) | Open Foundation ModelsMultimodal Models (Vision + Language) | — | CC-BY-4.0 | MODEL | Pure vision-based GUI agent framework that parses screen elements for AI automation. V2 achieves state-of-the-art on Screen Spot Pro benchmark. Powers computer-use agents with any vision model. CC-BY-4.0 licensed |
| OmniSVG | Open Foundation ModelsMultimodal Models (Vision + Language) | — | Apache-2.0 | MODEL | First family of end-to-end multimodal SVG generators leveraging pre-trained Vision-Language Models. Capable of generating complex SVGs from simple icons to intricate anime characters. NeurIPS 2025. Apache 2.0 licensed |
| One-API | Inference Engines & ServingHigh-performance Serving & API Servers | — | MIT | TOOL | LLM API management and key redistribution system. Unifies multiple providers (OpenAI, Anthropic, Azure, etc.) under a single OpenAI-compatible API with built-in rate limiting, quota management, and cost tracking. MIT licensed |
| oneDNN | Core Frameworks & LibrariesHigh-Performance Compute Libraries | — | Apache-2.0 | TOOL | oneAPI Deep Neural Network Library. Cross-platform performance library of basic building blocks for deep learning, optimized for Intel CPUs, GPUs, and Arm architectures. Apache 2.0 licensed |
| OneTrainer | Training & Fine-tuning EcosystemFull Training Frameworks | — | — | TOOL | One-stop solution for all your Diffusion training needs. Supports FLUX, Stable Diffusion 1.5/2.x/3.x/SDXL, Würstchen, PixArt, Hunyuan Video and more. Features full fine-tuning, LoRA, embeddings, masked training, automatic backups, and TensorBoard integration. GPL-3.0 licensed |
| Onlook | Developer Tools & IntegrationsAI-Native IDEs & Development Environments | — | — | TOOL | Open-source AI-first design and React editing environment for visually building and modifying frontend applications |
| ONNX | Core Frameworks & LibrariesHigh-Performance Compute Libraries | — | Apache-2.0 | TOOL | Open standard for machine learning interoperability. Open Neural Network Exchange provides an open ecosystem that empowers AI developers to choose the right tools as their project evolves. Apache 2.0 licensed |
| ONNX Model Zoo | MLOps / LLMOps & ProductionModel Hubs & Registries | — | Apache-2.0 | TOOL | Collection of pre-trained, state-of-the-art models in the ONNX format. 80+ models spanning vision, NLP, and audio with validation data and reference implementations. Apache 2.0 licensed |
| ONNX Runtime | Core Frameworks & LibrariesModel Training & Optimization Utilities | — | — | TOOL | High-performance inference and training for ONNX models across hardware |
| Onyx | User Interfaces & Self-hosted PlatformsFull Self-hosted AI Platforms | — | MIT | TOOL | Full-featured AI platform with Chat, RAG, Agents, and Actions. 40+ document connectors and every LLM support. MIT licensed (Community Edition) |
| Open Interpreter | Developer Tools & IntegrationsAI Coding Assistants (open-source) | — | — | TOOL | Lets LLMs run code locally |
| Open LLM Leaderboard (Hugging Face) | Resources & LearningPapers with Open Implementations | — | — | TOOL | Real-time ranking of open models |
| Open Multi-Agent | Agentic AI & Multi-Agent SystemsMulti-Agent Orchestration | — | MIT | AGENT | TypeScript-native multi-agent orchestration with multi-model teams and parallel execution. Automatically converts goals to task DAGs. MIT licensed |
| Open Notebook | Developer Tools & IntegrationsNotebooks & Interactive Computing | — | MIT | TOOL | Open-source implementation of Notebook LM with multi-modal content support (PDFs, videos, audio, web pages). Features multi-speaker podcast generation, 18+ AI provider integrations, and full-text + vector search. Self-hosted with complete data sovereignty. MIT licensed |
| Open SWE | Agentic AI & Multi-Agent SystemsAutonomous Coding Agents | — | — | AGENT | Asynchronous coding agent from the LangChain ecosystem for background software engineering tasks |
| Open WebUI | User Interfaces & Self-hosted PlatformsLocal AI Chat UIs & Personal Assistants | — | — | TOOL | Most popular self-hosted ChatGPT-style interface |
| Open-Sora (HPC-AI Tech) | Open Foundation ModelsVideo & Animation Models | — | Apache-2.0 | MODEL | Democratizing efficient video production for all. Complete open-source video generation system with 11B model achieving commercial-level quality. Apache 2.0 licensed |
| Open-Sora-Plan (PKU-YuanGroup) | Generative Media ToolsVideo Generation | — | MIT | TOOL | Reproduction of Sora with full open-source pipeline for text-to-video generation. MIT licensed |
| Open3D | Specialized Domains3D Vision & Point Cloud Processing | — | MIT | TOOL | Modern library for 3D data processing with Python and C++ APIs. Core features include 3D data structures, processing algorithms, scene reconstruction, surface alignment, 3D visualization, and GPU acceleration. MIT licensed |
| OpenAgents | Agentic AI & Multi-Agent SystemsMulti-Agent Orchestration | — | Apache-2.0 | AGENT | AI Agent Networks for Open Collaboration. Platform for building collaborative multi-agent systems with shared knowledge and distributed task execution. Apache 2.0 licensed |
| OpenAI Agents SDK | Agentic AI & Multi-Agent SystemsMulti-Agent Orchestration | — | — | AGENT | Production-ready lightweight framework for multi-agent workflows. The evolution of Swarm with enhanced orchestration capabilities and enterprise-grade features |
| OpenAI Evals | Evaluation, Benchmarks & DatasetsEvaluation Frameworks | — | MIT | TOOL | Framework for evaluating LLMs and LLM systems with an open-source registry of 100+ community-contributed benchmarks. MIT licensed |
| OpenBB | Specialized DomainsFinance & Quantitative AI | — | — | TOOL | Financial data platform for analysts, quants and AI agents. Open-source investment research infrastructure with extensive data integrations. AGPL-3.0 licensed |
| OpenClaw | User Interfaces & Self-hosted PlatformsLocal AI Chat UIs & Personal Assistants | — | — | TOOL | Local-first personal AI assistant with multi-channel integrations and full agentic task execution |
| OpenCLIP | Open Foundation ModelsMultimodal Models (Vision + Language) | — | — | MODEL | Open source implementation of CLIP with trained models and training code. Includes state-of-the-art trained ViT-G/14 models and comprehensive zero-shot evaluation suite |
| OpenCode | Agentic AI & Multi-Agent SystemsAutonomous Coding Agents | — | — | AGENT | Terminal-native autonomous coding agent |
| OpenCompass | Evaluation, Benchmarks & DatasetsBenchmark Suites | — | — | BENCHMARK | Evaluation platform for benchmarking language and multimodal models across large benchmark suites |
| OpenContracts | Specialized DomainsLegal AI & Contract Analysis | — | — | TOOL | Self-hosted document annotation platform for legal AI. Semantic search, contract analysis, version control, and MCP integration for building legal knowledge bases. AGPL-3.0 licensed |
| OpenCV | Specialized DomainsComputer Vision | — | — | TOOL | World's most widely used computer vision library |
| OpenEvals | Evaluation, Benchmarks & DatasetsEvaluation Frameworks | — | MIT | TOOL | Open-source evaluation library for LLM and agent applications. Built by LangChain with pre-built evaluators for common use cases including RAG, agents, and structured output validation. MIT licensed |
| OpenFold | Specialized DomainsScientific AI & Drug Discovery | — | Apache-2.0 | TOOL | Trainable PyTorch reproduction of AlphaFold2. Complete open-source pipeline for protein structure prediction with competitive accuracy to the original. Apache 2.0 licensed |
| OpenHands (ex-OpenDevin) | Agentic AI & Multi-Agent SystemsAutonomous Coding Agents | — | — | AGENT | Full-featured open-source AI software engineer |
| OpenLineage | MLOps / LLMOps & ProductionExperiment Tracking & Versioning | — | Apache-2.0 | TOOL | Open standard for lineage metadata collection designed to instrument jobs as they run. Defines a generic model of run, job, and dataset entities for consistent data lineage tracking. Apache 2.0 licensed |
| OpenLIT | MLOps / LLMOps & ProductionMonitoring, Evaluation & Observability | — | — | TOOL | OpenTelemetry-native LLM observability platform with GPU monitoring, evaluations, prompt management, and guardrails |
| OpenLLM (BentoML) | Inference Engines & ServingHigh-performance Serving & API Servers | — | Apache-2.0 | TOOL | Production-grade platform for running any open-source LLMs as OpenAI-compatible API endpoints. Supports 50+ models with built-in streaming, batching, and auto-acceleration. Apache 2.0 licensed |
| OpenLLMetry (Traceloop) | MLOps / LLMOps & ProductionMonitoring, Evaluation & Observability | — | — | TOOL | Open-source observability for GenAI/LLM applications based on OpenTelemetry with 25+ integration backends |
| OpenManus | Agentic AI & Multi-Agent SystemsMulti-Agent Orchestration | — | MIT | AGENT | Open-source framework for building general AI agents. Modular agent architecture with planning, tool use, and autonomous task execution. 56k+ stars. MIT licensed |
| OpenMetadata | Core Frameworks & LibrariesData Engineering & Feature Stores | — | Apache-2.0 | TOOL | Unified metadata platform for data discovery, observability, and governance. Column-level lineage, semantic search, and team collaboration with 70+ data service connectors. Apache 2.0 licensed |
| OpenMLDB | MLOps / LLMOps & ProductionFeature Engineering & Data Preparation | — | Apache-2.0 | TOOL | Open-source machine learning database providing a feature platform for consistent features between training and inference. Real-time relational data feature computation system for online ML applications. Apache 2.0 licensed |
| OpenPilot | Specialized DomainsAutonomous Driving & Robotics Simulators | — | MIT | TOOL | Operating system for robotics. Currently upgrades driver assistance systems on 300+ supported cars. End-to-end autonomous driving stack with open-source hardware and software. MIT licensed |
| OpenRefine | Core Frameworks & LibrariesData Labeling & Annotation | — | BSD-3-Clause | TOOL | Free, open-source power tool for working with messy data. Clean, transform, and extend data with web services. Formerly Google Refine. BSD-3-Clause licensed |
| OpenRLHF | Training & Fine-tuning EcosystemFull Training Frameworks | — | Apache-2.0 | TOOL | Easy-to-use, scalable RLHF framework based on Ray. Supports PPO, GRPO, REINFORCE++, DAPO with vLLM integration and async training. Apache 2.0 licensed |
| OpenSearch | Retrieval-Augmented Generation (RAG) & KnowledgeVector Databases & Search Engines | — | — | TOOL | Open-source distributed and RESTful search and analytics suite with native vector search. Enterprise-grade fork of Elasticsearch with k-NN plugin for semantic search at scale |
| OpenSpiel | Specialized DomainsGame AI & Simulations | — | Apache-2.0 | TOOL | Collection of environments and algorithms for research in general reinforcement learning and search/planning in games from Google DeepMind. Apache 2.0 licensed |
| OpenSplat | Generative Media Tools3D & Creative Tools | — | — | TOOL | Production-grade, portable implementation of 3D Gaussian Splatting with CPU/GPU support for Windows, Mac, and Linux. Creates 3D scenes from camera poses and sparse points. AGPL-3.0 licensed |
| OpenThoughts | Evaluation, Benchmarks & DatasetsHigh-quality Open Datasets & Data Tools | — | Apache-2.0 | TOOL | Fully open data curation for reasoning models. Curated high-quality reasoning datasets for training and evaluating LLMs. Apache 2.0 licensed |
| OpenVINO | Specialized DomainsEdge / On-device AI | — | — | TOOL | Intel's toolkit for edge deployment |
| OpenVINO Open Model Zoo | MLOps / LLMOps & ProductionModel Hubs & Registries | — | Apache-2.0 | TOOL | Pre-trained deep learning models and demos optimized for Intel hardware. 200+ public pre-trained models for vision, speech, and NLP with benchmarking tools and accuracy metrics. Apache 2.0 licensed |
| Opik (Comet) | MLOps / LLMOps & ProductionMonitoring, Evaluation & Observability | — | — | TOOL | Production-ready LLM evaluation platform |
| optillm | Inference Engines & ServingAdditional Inference Engines | — | Apache-2.0 | TOOL | Optimizing inference proxy for LLMs with load balancing, failover, and request routing across multiple providers and models. Improves reliability and performance for production deployments. Apache 2.0 licensed |
| Optimum | Inference Engines & ServingQuantization, Distillation & Optimization | — | — | TOOL | Hardware-specific acceleration and quantization |
| Optuna | Core Frameworks & LibrariesAutoML & Hyperparameter Optimization | — | — | TOOL | Modern, define-by-run hyperparameter optimization with pruning and visualizations. Extremely popular in 2026 |
| Orama | Retrieval-Augmented Generation (RAG) & KnowledgeVector Databases & Search Engines | — | — | TOOL | Lightweight search engine with full-text, vector, and hybrid search for browser, server, and edge applications |
| OSWorld | Evaluation, Benchmarks & DatasetsHigh-quality Open Datasets & Data Tools | — | — | TOOL | Multimodal agent benchmark dataset |
| Oumi | Training & Fine-tuning EcosystemFull Training Frameworks | — | Apache-2.0 | TOOL | Fully open-source platform for the complete foundation model lifecycle - from data preparation and training to evaluation and deployment. Supports 100+ models with 200+ recipes for fine-tuning gpt-oss, Qwen3, DeepSeek-R1, and more. Apache 2.0 licensed |
| OuteTTS / CosyVoice 2 | Open Foundation ModelsSpeech & Audio Models (TTS, STT, Music) | — | — | MODEL | High-quality open TTS with natural prosody and multilingual support |
| Outlines | Agentic AI & Multi-Agent SystemsPrompt Engineering & Structured Outputs | — | Apache-2.0 | AGENT | Structured outputs for LLMs. Guarantees valid JSON, regex-compliant text, and Pydantic model outputs during generation. Trusted by NVIDIA, Cohere, Hugging Face, and vLLM. Apache 2.0 licensed |
| OWL (camel-ai/owl) | Agentic AI & Multi-Agent SystemsDomain-Specific Agents | — | — | AGENT | Advanced multi-agent collaboration system |
| Oxen | Core Frameworks & LibrariesData Processing & Manipulation | — | Apache-2.0 | TOOL | Lightning fast data version control for machine learning. Optimized for large datasets with efficient diffing, branching, and collaboration. Apache 2.0 licensed |
| PaddleClas | MLOps / LLMOps & ProductionModel Hubs & Registries | — | Apache-2.0 | TOOL | Comprehensive image recognition and classification toolkit with rich model zoo. 5,800+ stars featuring 24 series of classification networks, 122 pretrained models, and end-to-end image recognition systems including PP-ShiTuV2. Apache 2.0 licensed |
| PaddleNLP | Training & Fine-tuning EcosystemFull Training Frameworks | — | Apache-2.0 | TOOL | Easy-to-use and powerful LLM library built on Baidu's PaddlePaddle framework. Supports 100+ models with efficient training, compression, and high-performance inference on diverse hardware. Features RsLoRA+ algorithm, DeepSeek V3/R1 support with FP8/INT8 quantization, and unified checkpointing. Apache 2.0 licensed |
| PaddlePaddle | Core Frameworks & LibrariesDeep Learning Frameworks | — | — | TOOL | Industrial deep learning platform from Baidu serving 23+ million developers and 760,000+ companies. China's first independent R&D framework with advanced distributed training and deployment capabilities |
| PaddleSeg | MLOps / LLMOps & ProductionModel Hubs & Registries | — | Apache-2.0 | TOOL | Easy-to-use image segmentation library with awesome pre-trained model zoo. Supports semantic segmentation, interactive segmentation, panoptic segmentation, image matting, and 3D segmentation with 200+ pre-trained models. Apache 2.0 licensed |
| PageIndex (VectifyAI) | Retrieval-Augmented Generation (RAG) & KnowledgeRAG Frameworks & Advanced Retrieval Tools | — | MIT | TOOL | Vectorless, reasoning-based RAG framework using document index structure. Achieves high accuracy without vector databases through intelligent context engineering and reasoning-based retrieval. MIT licensed |
| Pandas | Core Frameworks & LibrariesData Processing & Manipulation | — | — | TOOL | The gold standard for data analysis and manipulation in Python |
| Pandera | Core Frameworks & LibrariesData Processing & Manipulation | — | MIT | TOOL | Statistical data testing and validation for dataframes. Pydantic-like API for Pandas, Polars, and other dataframe libraries with type hints and lazy validation. MIT licensed |
| Paperclip | Agentic AI & Multi-Agent SystemsMulti-Agent Orchestration | — | MIT | AGENT | AI agent company and orchestration framework with 55K+ stars. MIT licensed |
| Paperless-AI | Retrieval-Augmented Generation (RAG) & KnowledgeWeb Data Ingestion | — | — | TOOL | Automated document analyzer for Paperless-ngx with RAG-powered semantic search across your document archive |
| Papers with Code | Resources & LearningPapers with Open Implementations | — | — | TOOL | Definitive database linking papers to open code and datasets |
| ParadeDB | Retrieval-Augmented Generation (RAG) & KnowledgeVector Databases & Search Engines | — | — | TOOL | Postgres-native search and analytics engine for full-text, faceted, and hybrid retrieval without moving data out of PostgreSQL |
| Parlant | Agentic AI & Multi-Agent SystemsDomain-Specific Agents | — | Apache-2.0 | AGENT | Conversational control layer for customer-facing AI agents. Enterprise-grade context engineering framework optimized for consistent, compliant, and on-brand B2C and sensitive B2B interactions. Apache 2.0 licensed |
| Pathway | Retrieval-Augmented Generation (RAG) & KnowledgeRAG Frameworks & Advanced Retrieval Tools | — | Apache-2.0 | TOOL | Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG. Features 350+ connectors with always-in-sync data from SharePoint, Google Drive, S3, Kafka, PostgreSQL and more. BSL 1.1 license (becomes Apache 2.0 after 4 years) |
| Peekaboo | Developer Tools & IntegrationsIDE Plugins & Extensions | — | MIT | TOOL | macOS CLI & MCP server enabling AI agents to capture screenshots and automate UI interactions. Visual question answering through local or remote AI models. MIT licensed |
| PEFT (Parameter-Efficient Fine-Tuning) | Training & Fine-tuning EcosystemLoRA / PEFT Tools | — | — | TOOL | Official library with LoRA, QLoRA, DoRA, etc |
| PentestAgent (GH05TCREW) | AI Safety, Alignment & InterpretabilityAdversarial & Red-teaming Tools | — | MIT | TOOL | AI agent framework for black-box security testing, supporting bug bounty, red-team, and penetration testing workflows. MIT licensed |
| Pezzo | MLOps / LLMOps & ProductionMonitoring, Evaluation & Observability | — | Apache-2.0 | TOOL | Cloud-native LLMOps platform with prompt management, versioning, and observability. Features collaborative prompt editing, A/B testing, and cost analytics. Apache 2.0 licensed |
| pgvector | Retrieval-Augmented Generation (RAG) & KnowledgeVector Databases & Search Engines | — | — | TOOL | PostgreSQL extension for vector similarity search |
| pgvectorscale | Retrieval-Augmented Generation (RAG) & KnowledgeVector Databases & Search Engines | — | — | TOOL | PostgreSQL extension for scalable vector search with DiskANN algorithm. Complements pgvector with significantly faster search and higher recall at large scale. PostgreSQL licensed |
| Phi-4 (Microsoft) | Open Foundation ModelsLarge Language Models (Base + Chat) | — | — | MODEL | Small but highly capable models optimized for reasoning, edge devices, and on-device inference. Includes Phi-4-reasoning variants with thinking capabilities |
| Phoenix (Arize) | MLOps / LLMOps & ProductionMonitoring, Evaluation & Observability | — | — | TOOL | AI observability & evaluation platform |
| Pi (badlogic) | Agentic AI & Multi-Agent SystemsAutonomous Coding Agents | — | — | AGENT | Terminal coding agent with hash-anchored edits, LSP integration, subagents, MCP support, and package ecosystem |
| PinchBench | Evaluation, Benchmarks & DatasetsBenchmark Suites | — | MIT | BENCHMARK | Benchmarking system for evaluating LLM models as OpenClaw coding agents. Built with Rust by the kilo.ai team. MIT licensed |
| PINTO Model Zoo | MLOps / LLMOps & ProductionModel Hubs & Registries | — | MIT | TOOL | Repository for storing models inter-converted between various frameworks. Supports TensorFlow, PyTorch, ONNX, OpenVINO, TFJS, TFTRT, TensorFlowLite (Float32/16/INT8), EdgeTPU, and CoreML. 4,100+ stars with extensive model conversion tools for edge deployment. MIT licensed |
| Pipecat | User Interfaces & Self-hosted PlatformsAgent & Voice Infrastructure | — | — | TOOL | Open-source framework for voice and multimodal conversational AI. Build real-time voice agents with support for speech-to-text, LLMs, text-to-speech, and live video. BSD-2-Clause licensed |
| Plane | User Interfaces & Self-hosted PlatformsFull Self-hosted AI Platforms | — | — | TOOL | Open-source Jira, Linear, Monday, and ClickUp alternative. AI-powered project management platform with intelligent task triage, sprint planning, and automated workflows. AGPL-3.0 licensed |
| PocketFlow | Agentic AI & Multi-Agent SystemsSingle-Agent Frameworks | — | — | AGENT | 100-line minimalist LLM framework for building agent workflows. Lightweight, extensible architecture for tool use and autonomous task execution |
| PocketPal AI | User Interfaces & Self-hosted PlatformsDesktop & Mobile AI Apps | — | MIT | TOOL | Open-source app that brings small language models directly to your phone. Run AI 100% privately on iOS and Android with no cloud required. MIT licensed |
| Point Cloud Library (PCL) | Specialized Domains3D Vision & Point Cloud Processing | — | BSD | TOOL | Standalone, large-scale open project for 2D/3D image and point cloud processing. Comprehensive algorithms for filtering, feature estimation, surface reconstruction, registration, model fitting, and segmentation. BSD licensed |
| Polars | Core Frameworks & LibrariesData Processing & Manipulation | — | — | TOOL | Blazing-fast DataFrame library (Rust backend) - modern alternative to Pandas for large-scale workloads |
| Polyaxon | MLOps / LLMOps & ProductionDeployment & Orchestration | — | Apache-2.0 | TOOL | MLOps Tools For Managing & Orchestrating The Machine Learning LifeCycle. Reproducible and scalable machine learning workflows on Kubernetes with experiment tracking, model management, and pipeline orchestration. Apache 2.0 licensed |
| Portkey Gateway | MLOps / LLMOps & ProductionMonitoring, Evaluation & Observability | — | MIT | TOOL | Blazing fast AI Gateway to route 200+ LLMs with unified API. Integrated guardrails, load balancing, fallbacks, and cost tracking. MIT licensed |
| PowerInfer | Inference Engines & ServingAdditional Inference Engines | — | MIT | TOOL | High-speed LLM inference for local deployment on consumer GPUs. Achieves up to 11x speedup over llama.cpp on RTX 4090 by exploiting power-law neuron activation patterns. MIT licensed |
| PowerPaint (OpenMMLab) | Generative Media ToolsImage Generation & Editing | — | — | TOOL | Versatile image inpainting model supporting text-guided inpainting, object removal, and outpainting (ECCV 2024) |
| PR-Agent (Qodo) | Developer Tools & IntegrationsCLI Tools & API Clients | — | — | TOOL | AI-powered code review agent for GitHub, GitLab, Bitbucket, and Azure DevOps. Automated PR analysis, improvement suggestions, and multi-platform deployment via CLI, GitHub Actions, or webhooks. AGPL-3.0 licensed |
| Practical RL (Yandex Data School) | Resources & LearningCourses & Interactive Playgrounds | — | — | TOOL | Comprehensive reinforcement learning course covering RL fundamentals, deep RL, policy gradients, actor-critic methods, and practical applications in the wild. The Unlicense |
| PraisonAI | Agentic AI & Multi-Agent SystemsDomain-Specific Agents | — | — | AGENT | 24/7 AI employee team for automating complex challenges. Low-code multi-agent framework with handoffs, guardrails, memory, RAG, and 100+ LLM providers |
| Prefect | MLOps / LLMOps & ProductionDeployment & Orchestration | — | Apache-2.0 | TOOL | Workflow orchestration framework for building resilient data and ML pipelines. Python-native with modern observability and 200+ integrations. Apache 2.0 licensed |
| PRIME-RL | Training & Fine-tuning EcosystemFull Training Frameworks | — | Apache-2.0 | TOOL | Agentic RL Training at Scale from Prime Intellect. Framework for large-scale reinforcement learning capable of scaling to 1000+ GPUs with fully asynchronous RL, FSDP2 training, and vLLM inference. Apache 2.0 licensed |
| PrivateGPT | Retrieval-Augmented Generation (RAG) & KnowledgeRAG Frameworks & Advanced Retrieval Tools | — | — | TOOL | Private document Q&A project for local and offline RAG workflows where data stays inside the user's environment |
| Prompt Engineering Guide (DAIR-AI) | Resources & LearningEducational Resources & Courses | — | MIT | TOOL | Comprehensive guides, papers, lessons, and notebooks for prompt engineering, context engineering, RAG, and AI Agents. The definitive open-source resource for learning prompt engineering with 3M+ learners. MIT licensed |
| Prompt Optimizer | Agentic AI & Multi-Agent SystemsPrompt Engineering & Structured Outputs | — | — | AGENT | AI prompt optimization tool with multi-round iterative improvements, dual-mode optimization for system and user prompts, and multi-model support. Available as web app, desktop app, Chrome extension, and Docker deployment. AGPL-3.0 licensed |
| Promptfoo | MLOps / LLMOps & ProductionGuardrails & Safety Tools | — | MIT | TOOL | Open-source LLM evaluation and red teaming framework. Test prompts, agents, and RAGs with automated security vulnerability scanning, side-by-side model comparison, and CI/CD integration. Now part of OpenAI. MIT licensed |
| Promptify | Agentic AI & Multi-Agent SystemsPrompt Engineering & Structured Outputs | — | Apache-2.0 | AGENT | Task-based NLP engine with Pydantic structured outputs, built-in evaluation, and LiteLLM as the universal LLM backend. Think "scikit-learn for LLM-powered NLP". Apache 2.0 licensed |
| PromptTools | Agentic AI & Multi-Agent SystemsPrompt Engineering & Structured Outputs | — | Apache-2.0 | AGENT | Open-source tools for prompt testing and experimentation with support for LLMs and vector databases. Test prompt variants across multiple providers (OpenAI, LLaMA) and vector stores (Chroma, Weaviate, LanceDB). Apache 2.0 licensed |
| Protenix | Specialized DomainsScientific AI & Drug Discovery | — | Apache-2.0 | TOOL | High-accuracy open-source biomolecular structure prediction model from ByteDance. First fully open-source model to outperform AlphaFold3 across diverse benchmarks with Apache 2.0 licensing for both academic and commercial use |
| ProxyAI | Developer Tools & IntegrationsIDE Plugins & Extensions | — | Apache-2.0 | TOOL | Leading open-source AI copilot for JetBrains IDEs. Connect to any model in any environment with auto-apply, image chat, file references, web search, and customizable personas. Apache 2.0 licensed |
| PurpleLlama (Meta) | MLOps / LLMOps & ProductionGuardrails & Safety Tools | — | BSD-3-Clause | TOOL | Comprehensive set of tools to assess and improve LLM security. Includes Llama Guard safety classifiers, CyberSec Eval benchmarks, and Prompt Guard for prompt injection detection. BSD-3-Clause licensed |
| PydanticAI | Agentic AI & Multi-Agent SystemsSingle-Agent Frameworks | — | MIT | AGENT | Type-safe AI agent framework from the creators of Pydantic. Model-agnostic with 20+ providers, built-in observability via Logfire, MCP/A2A protocol support, and YAML/JSON agent definitions. MIT licensed |
| PyMC | Specialized DomainsProbabilistic Programming & Bayesian ML | — | Apache-2.0 | TOOL | Modern, comprehensive probabilistic programming framework in Python. Bayesian modeling with advanced MCMC sampling, variational inference, and seamless integration with ArviZ for visualization. Apache 2.0 licensed |
| PyRIT (Microsoft) | AI Safety, Alignment & InterpretabilityAdversarial & Red-teaming Tools | — | MIT | TOOL | Python Risk Identification Tool for generative AI. Microsoft's open-source framework for automated red teaming with multi-modal attack support, crescendo strategies, and 100+ operations experience. MIT licensed |
| Pythia (EleutherAI) | Open Foundation ModelsLarge Language Models (Base + Chat) | — | Apache-2.0 | MODEL | Suite of interpretability-focused LLMs (70M to 12B parameters) with fully open training data, intermediate checkpoints, and analysis tools. Designed for studying learning dynamics and interpretability with public domain training data. Apache 2.0 licensed |
| PyTorch | Core Frameworks & LibrariesDeep Learning Frameworks | — | — | TOOL | Dynamic computation graphs, Pythonic API, dominant in research and production. The current standard for most frontier AI work |
| PyTorch Forecasting | Core Frameworks & LibrariesClassical ML & Gradient Boosting | — | MIT | TOOL | Time series forecasting with PyTorch. Multiple neural architectures (N-BEATS, TFT, DeepAR) with in-built interpretation capabilities, built on PyTorch Lightning for distributed training. MIT licensed |
| PyTorch Geometric | Core Frameworks & LibrariesDeep Learning Frameworks | — | — | TOOL | Library for deep learning on irregular input data such as graphs, point clouds, and manifolds. Part of the PyTorch ecosystem |
| PyTorch Ignite | Core Frameworks & LibrariesModel Training & Optimization Utilities | — | BSD-3-Clause | TOOL | High-level library for training and evaluating neural networks in PyTorch with an engine, events & handlers system for maximum flexibility. BSD-3-Clause licensed |
| PyTorch Lightning | Core Frameworks & LibrariesModel Training & Optimization Utilities | — | — | TOOL | High-level wrapper for PyTorch that removes boilerplate and adds best practices |
| PyTorch3D | Specialized Domains3D Vision & Point Cloud Processing | — | BSD | TOOL | FAIR's library of reusable components for deep learning with 3D data. Provides efficient 3D operators, differentiable rendering, and mesh processing tools integrated with PyTorch. BSD licensed |
| Qdrant | Retrieval-Augmented Generation (RAG) & KnowledgeVector Databases & Search Engines | — | — | TOOL | High-performance vector search engine in Rust |
| Qlib | Specialized DomainsFinance & Quantitative AI | — | MIT | TOOL | AI-oriented quantitative investment platform from Microsoft. Supports diverse ML modeling paradigms including supervised learning, market dynamics modeling, and RL. Now equipped with RD-Agent for automated R&D process. MIT licensed |
| Quarto | Developer Tools & IntegrationsNotebooks & Interactive Computing | — | MIT | TOOL | Open-source scientific and technical publishing system built on Pandoc. Create dynamic content with Python, R, Julia, and Observable. MIT licensed |
| Quickwit | Retrieval-Augmented Generation (RAG) & KnowledgeVector Databases & Search Engines | — | — | TOOL | Cloud-native search engine for observability. Open-source alternative to Datadog, Elasticsearch, Loki, and Tempo with native vector search support |
| Qwen Code | Developer Tools & IntegrationsAI Coding Assistants (open-source) | — | Apache-2.0 | TOOL | Open-source AI agent for the terminal, optimized for Qwen series models. Multi-protocol provider support including OpenAI, Anthropic, Gemini, Alibaba Cloud, OpenRouter. Features agentic workflow with Skills and SubAgents. Apache 2.0 licensed |
| Qwen-Agent | Agentic AI & Multi-Agent SystemsSingle-Agent Frameworks | — | Apache-2.0 | AGENT | Agent framework built on Qwen models featuring function calling, MCP support, code interpreter, RAG, and Chrome extension. Powers Qwen Chat with advanced tool use and planning capabilities. Apache 2.0 licensed |
| Qwen-Image (Alibaba) | Generative Media ToolsImage Generation & Editing | — | Apache-2.0 | TOOL | 20B MMDiT image foundation model with state-of-the-art complex text rendering and precise image editing. Strong performance in Chinese text generation. Apache 2.0 licensed |
| Qwen3 (Alibaba) | Open Foundation ModelsLarge Language Models (Base + Chat) | — | Apache-2.0 | MODEL | Flagship dense and MoE models with hybrid thinking modes (32B/235B). Apache 2.0 licensed with 128K context and superior agentic capabilities |
| Qwen3-Coder-Next (Alibaba) | Open Foundation ModelsCoding & Reasoning Models | — | — | MODEL | Leading open coding model. Strong Pareto frontier for cost-effective agent deployment |
| Qwen3-TTS (Alibaba) | Open Foundation ModelsSpeech & Audio Models (TTS, STT, Music) | — | Apache-2.0 | MODEL | Open TTS series supporting stable, expressive, and streaming speech generation with free-form voice design and vivid voice cloning. Natural language instruction-driven control over timbre, emotion, and prosody. Apache 2.0 licensed |
| Qwen3-VL (Alibaba) | Open Foundation ModelsMultimodal Models (Vision + Language) | — | — | MODEL | Latest flagship VLM with native 256K context (expandable to 1M), visual agent capabilities, 3D grounding, and superior multimodal reasoning. Major leap over Qwen2.5-VL |
| Qwen3.6 (Alibaba) | Open Foundation ModelsLarge Language Models (Base + Chat) | — | — | MODEL | Latest flagship series released April 2026 with 1M context window, agentic coding performance competitive with Claude 4.5 Opus, and enhanced multimodal capabilities |
| r/LocalLLaMA | Resources & LearningEducational Resources & Courses | — | — | TOOL | Go-to subreddit for local/open-source LLM topics |
| RAG Web UI | User Interfaces & Self-hosted PlatformsFull Self-hosted AI Platforms | — | Apache-2.0 | TOOL | Intelligent dialogue system based on RAG technology. Build intelligent Q&A systems on your own knowledge base with modern web interface. Apache-2.0 licensed |
| RAG-Anything | Retrieval-Augmented Generation (RAG) & KnowledgeRAG Frameworks & Advanced Retrieval Tools | — | — | TOOL | All-in-One Multimodal RAG system for seamless processing of text, images, tables, and equations. Built on LightRAG |
| RAGAs | Evaluation, Benchmarks & DatasetsEvaluation Frameworks | — | — | TOOL | End-to-end RAG evaluation framework |
| RAGFlow | Retrieval-Augmented Generation (RAG) & KnowledgeRAG Frameworks & Advanced Retrieval Tools | — | — | TOOL | Deep-document-understanding RAG engine |
| RAGLite (Superlinear) | Retrieval-Augmented Generation (RAG) & KnowledgeRAG Frameworks & Advanced Retrieval Tools | — | MPL-2.0 | TOOL | Python toolkit for RAG with DuckDB or PostgreSQL. Lightweight, efficient retrieval-augmented generation without heavy dependencies. MPL 2.0 licensed |
| Ralph | Developer Tools & IntegrationsAI-Native IDEs & Development Environments | — | MIT | TOOL | Autonomous AI development loop for Claude Code with intelligent exit detection. Automates iterative coding workflows with self-monitoring capabilities. MIT licensed |
| RamaLama | Inference Engines & ServingLocal / On-device Inference | — | — | TOOL | Container-centric tool for simplifying local AI model serving. Automatically detects GPUs, pulls optimized container images, and runs models securely in rootless containers with enterprise-grade isolation |
| Ray Train | Training & Fine-tuning EcosystemDistributed Training | — | — | TOOL | Scalable distributed training |
| Reader (Jina AI) | Retrieval-Augmented Generation (RAG) & KnowledgeRAG Frameworks & Advanced Retrieval Tools | — | Apache-2.0 | TOOL | Convert any URL to LLM-friendly input with a simple prefix (r.jina.ai). Free service that extracts article content, removes clutter, and returns clean Markdown for RAG and agentic workflows. Apache 2.0 licensed |
| Real-Time Voice Cloning | Generative Media ToolsAudio / Music / Voice Generation | — | MIT | TOOL | Clone a voice in 5 seconds to generate arbitrary speech in real-time. SV2TTS implementation with speaker encoder and vocoder for instant voice synthesis. MIT licensed |
| RedAmon | AI Safety, Alignment & InterpretabilityAdversarial & Red-teaming Tools | — | MIT | TOOL | AI-powered agentic red team framework that automates offensive security operations from reconnaissance to exploitation to post-exploitation with zero human intervention. Integrates multiple security tools for comprehensive penetration testing. MIT licensed |
| RediSearch | Retrieval-Augmented Generation (RAG) & KnowledgeVector Databases & Search Engines | — | — | TOOL | Full-text, secondary indexing, and vector similarity search for Redis deployments. Useful when retrieval needs low-latency Redis-native search |
| Refact | Developer Tools & IntegrationsAI Coding Assistants (open-source) | — | BSD-3-Clause | TOOL | Open-source AI code assistant with autocomplete, chat, and refactoring. Self-hostable with support for multiple LLM providers. BSD-3-Clause licensed |
| Repomix | Developer Tools & IntegrationsCLI Tools & API Clients | — | MIT | TOOL | Powerful tool that packs your entire repository into a single AI-friendly file. Perfect for feeding codebases to LLMs with smart filtering and token counting. MIT licensed |
| rerankers (Answer.AI) | Retrieval-Augmented Generation (RAG) & KnowledgeRAG Frameworks & Advanced Retrieval Tools | — | Apache-2.0 | TOOL | Lightweight unified API for all common reranking and cross-encoder models. Supports RankGPT, ColBERT, FlashRank, and API-based rerankers with a dependency-free core. Apache 2.0 licensed |
| Responsible AI Toolbox | AI Safety, Alignment & InterpretabilityResponsible AI Development | — | MIT | TOOL | Suite of tools providing model and data exploration, assessment interfaces and libraries for understanding AI systems. Enables developers to develop and monitor AI more responsibly with better data-driven actions. MIT licensed |
| Rig | Retrieval-Augmented Generation (RAG) & KnowledgeLLM Application Frameworks | — | MIT | TOOL | Rust library for building scalable, modular LLM-powered applications. Type-safe agent framework with unified LLM interface, built-in vector store integrations, and ergonomic abstractions for production AI systems. MIT licensed |
| RL Baselines3 Zoo | Specialized DomainsGame AI & Simulations | — | MIT | TOOL | A training framework for Stable Baselines3 reinforcement learning agents with hyperparameter optimization, pre-trained agents, and extensive benchmark environments. MIT licensed |
| RLinf | Training & Fine-tuning EcosystemDistributed Training | — | Apache-2.0 | TOOL | Scalable open-source RL infrastructure for post-training foundation models via reinforcement learning. Features M2Flow paradigm for embodied AI and agentic workflows with real-world robotics integrations. Apache 2.0 licensed |
| rLLM | Training & Fine-tuning EcosystemFull Training Frameworks | — | Apache-2.0 | TOOL | Democratizing Reinforcement Learning for LLMs. Framework for training AI agents with RL featuring near-zero code changes, CLI-first workflow, and 50+ built-in benchmarks. Supports GRPO, REINFORCE, RLOO with verl and tinker backends. Apache 2.0 licensed |
| Roo Code | Developer Tools & IntegrationsAI Coding Assistants (open-source) | — | — | TOOL | Open-source editor-based coding agent with multiple modes and tool integrations |
| RTAB-Map | Specialized Domains3D Vision & Point Cloud Processing | — | BSD | TOOL | Real-Time Appearance-Based Mapping library for RGB-D, Stereo and LiDAR SLAM. Graph-based SLAM approach with incremental appearance-based loop closure detection for large-scale and long-term operation. BSD licensed |
| RTP-LLM (Alibaba) | Inference Engines & ServingHigh-performance Serving & API Servers | — | Apache-2.0 | TOOL | Alibaba's high-performance LLM inference acceleration engine. Powers production LLM services across Taobao, Tmall, and Alibaba's international AI platform. Supports PagedAttention, FlashAttention, FlashDecoding, INT8/INT4 quantization, and heterogeneous hardware (GPU/ARM CPU/Intel). Apache 2.0 licensed |
| ruby_llm | Retrieval-Augmented Generation (RAG) & KnowledgeLLM Application Frameworks | — | MIT | TOOL | One beautiful Ruby API for OpenAI, Anthropic, Gemini, Bedrock, Azure, OpenRouter, DeepSeek, Ollama, and 15+ providers. Agents, Chat, Vision, Audio, PDF, Images, Embeddings, Tools, Streaming and Rails integration. MIT licensed |
| Ruler | Developer Tools & IntegrationsCLI Tools & API Clients | — | MIT | TOOL | Central AI agent rule registry. Manages and distributes rules for AI coding agents across projects. MIT licensed |
| RWKV-7 "Goose" (BlinkDL) | Open Foundation ModelsLarge Language Models (Base + Chat) | — | — | MODEL | Novel RNN architecture with transformer-level LLM performance. 100% attention-free, linear-time, constant-space (no kv-cache), infinite ctx_len. Linux Foundation AI project with runtime already deployed in Windows & Office |
| SAELens | AI Safety, Alignment & InterpretabilityInterpretability & Explainability | — | — | TOOL | Sparse autoencoders for interpretable features |
| Safe-RLHF | AI Safety, Alignment & InterpretabilityAlignment & RLHF Tools | — | — | TOOL | Safe reinforcement learning from human feedback |
| safetensors | Core Frameworks & LibrariesModel Training & Optimization Utilities | — | — | TOOL | Simple, safe way to store and distribute tensors. Fast, secure alternative to pickle for model serialization |
| SAM 2 | Specialized DomainsComputer Vision | — | — | TOOL | Promptable image and video segmentation model with released checkpoints and training code |
| scikit-learn | Core Frameworks & LibrariesClassical ML & Gradient Boosting | — | — | TOOL | Industry-standard library for traditional machine learning (classification, regression, clustering, pipelines) |
| SciPy | Core Frameworks & LibrariesData Processing & Manipulation | — | — | TOOL | Scientific computing algorithms (optimization, linear algebra, statistics, signal processing) |
| Scrapy | Core Frameworks & LibrariesData Processing & Manipulation | — | BSD-3-Clause | TOOL | Fast, high-level web crawling and scraping framework for Python. Extract structured data from websites at scale with built-in support for handling common challenges like pagination, cookies, and concurrent requests. BSD-3-Clause licensed |
| SD.Next | Generative Media ToolsImage Generation & Editing | — | — | TOOL | All-in-one WebUI for AI generative image and video creation with multi-platform support, SDNQ quantization, and balanced CPU/GPU memory offload |
| SDG (Harbin Institute) | Training & Fine-tuning EcosystemSynthetic Data Generation | — | Apache-2.0 | TOOL | Specialized framework for generating high-quality structured tabular synthetic data with CTGAN models supporting billion-level data processing. Apache 2.0 licensed |
| SDV (Synthetic Data Vault) | Training & Fine-tuning EcosystemSynthetic Data Generation | — | — | TOOL | High-fidelity tabular and relational synthetic data |
| Search-R1 | Training & Fine-tuning EcosystemFull Training Frameworks | — | Apache-2.0 | TOOL | RL training framework for reasoning and search engine calling. Enables LLMs to interleave reasoning with real-time web search for enhanced knowledge retrieval. Built on veRL with efficient distributed training. Apache 2.0 licensed |
| Seldon Core | MLOps / LLMOps & ProductionDeployment & Orchestration | — | — | TOOL | MLOps and LLMOps framework for deploying, managing and scaling AI systems in Kubernetes. Standardized deployment across model types with autoscaling, multi-model serving, and A/B experiments |
| Self-hosted AI Starter Kit (n8n) | User Interfaces & Self-hosted PlatformsFull Self-hosted AI Platforms | — | Apache-2.0 | TOOL | Open-source Docker Compose template to quickly set up a local AI environment. Curated by n8n, combines self-hosted n8n with Ollama, Qdrant, and PostgreSQL for secure, self-hosted AI workflows. Apache 2.0 licensed |
| Semantic Kernel | Agentic AI & Multi-Agent SystemsSingle-Agent Frameworks | — | — | AGENT | SDK for building and orchestrating AI agents and workflows across multiple programming languages |
| Semantic Router | Retrieval-Augmented Generation (RAG) & KnowledgeRAG Frameworks & Advanced Retrieval Tools | — | MIT | TOOL | Superfast AI decision-making layer for LLMs and agents. Uses semantic vector space to route requests using semantic meaning rather than waiting for slow LLM generations. Cuts routing time from seconds to milliseconds. MIT licensed |
| sentence-transformers | Core Frameworks & LibrariesNLP & Transformers | — | — | TOOL | Classic library for sentence and image embeddings |
| Serena | Developer Tools & IntegrationsIDE Plugins & Extensions | — | MIT | TOOL | Powerful MCP toolkit for coding agents providing semantic retrieval and editing capabilities. Integrates language servers for IDE-level code understanding. MIT licensed |
| SGLang | Inference Engines & ServingHigh-performance Serving & API Servers | — | — | TOOL | Next-gen serving framework with RadixAttention. Powers xAI's production workloads at 100K+ GPUs scale |
| SHAP | Core Frameworks & LibrariesModel Training & Optimization Utilities | — | — | TOOL | Game theoretic approach to explain the output of any machine learning model. Industry standard for model interpretability |
| Shapash | AI Safety, Alignment & InterpretabilityInterpretability & Explainability | — | Apache-2.0 | TOOL | User-friendly explainability library for transparent ML models. Beautiful visualizations with explicit labels that everyone can understand. Generates web reports and integrates with SHAP/LIME. Apache 2.0 licensed |
| shimmy | Inference Engines & ServingAdditional Inference Engines | — | Apache-2.0 | TOOL | Python-free Rust inference server with OpenAI API compatibility. Supports GGUF and SafeTensors formats with hot model swap, auto-discovery, and single binary deployment for zero-dependency inference. Apache 2.0 licensed |
| Show-o | Open Foundation ModelsMultimodal Models (Vision + Language) | — | Apache-2.0 | MODEL | Unified multimodal model for both multimodal understanding and text-to-image generation with transformative autoregressive modeling. Apache 2.0 licensed |
| SillyTavern | User Interfaces & Self-hosted PlatformsDesktop & Mobile AI Apps | — | — | TOOL | Highly customizable role-playing frontend |
| Sim Studio | Agentic AI & Multi-Agent SystemsMulti-Agent Orchestration | — | Apache-2.0 | AGENT | Open-source AI workspace for building, deploying, and orchestrating AI agents. Visual canvas with 1000+ integrations, multi-framework support (Agno, OpenAI, LangChain, Google ADK), and self-hosted or cloud deployment. Apache 2.0 licensed |
| SimpleEvals (OpenAI) | Evaluation, Benchmarks & DatasetsEvaluation Frameworks | — | MIT | TOOL | Lightweight library for evaluating language models with transparent accuracy numbers. Reference implementations for MMLU, GPQA, MATH, HumanEval, MGSM, DROP, and SimpleQA benchmarks. MIT licensed |
| simpleRL-reason | Training & Fine-tuning EcosystemFull Training Frameworks | — | MIT | TOOL | Simple reinforcement learning recipe to improve models' reasoning abilities. Rule-based reward with GSM8K/Math datasets, extending from OpenRLHF. MIT licensed |
| skorch | Core Frameworks & LibrariesModel Training & Optimization Utilities | — | — | TOOL | scikit-learn compatible neural network library that wraps PyTorch. Seamlessly integrate PyTorch models with scikit-learn pipelines, grid search, and cross-validation |
| skrl | Specialized DomainsGame AI & Simulations | — | MIT | TOOL | Modular reinforcement learning library implemented in PyTorch, JAX, and NVIDIA Warp with support for Gymnasium, NVIDIA Isaac Lab, MuJoCo Playground, and other environments. MIT licensed |
| skrub | Core Frameworks & LibrariesData Processing & Manipulation | — | BSD-3-Clause | TOOL | Machine learning with dataframes for dirty categorical data. Preprocessing and feature engineering for heterogeneous data with seamless Pandas/Polars integration. BSD-3-Clause licensed |
| sktime | Core Frameworks & LibrariesClassical ML & Gradient Boosting | — | — | TOOL | Unified framework for machine learning with time series. scikit-learn compatible API for forecasting, classification, clustering, and anomaly detection |
| SkyPilot | MLOps / LLMOps & ProductionDeployment & Orchestration | — | Apache-2.0 | TOOL | Run, manage, and scale AI workloads on any AI infrastructure. Unified interface to access and manage compute across Kubernetes, Slurm, and 20+ cloud providers. Used by Shopify and research institutions for training and inference. Apache 2.0 licensed |
| SkyReels V2/V3 (Skywork) | Generative Media ToolsVideo Generation | — | — | TOOL | First open-source infinite-length film generative model using AutoRegressive Diffusion-Forcing |
| Skywork-R1V (Skywork AI) | Open Foundation ModelsMultimodal Models (Vision + Language) | — | MIT | MODEL | Advanced multimodal reasoning model specializing in vision-language tasks with chain-of-thought capabilities. State-of-the-art open multimodal reasoning with 76.0 on MMMU benchmark. MIT licensed |
| slime | Training & Fine-tuning EcosystemFull Training Frameworks | — | Apache-2.0 | TOOL | LLM post-training framework for RL Scaling from THUDM. Supports SFT and RL training with multi-turn compilation feedback, powering projects like TritonForge for automated GPU kernel generation. Apache 2.0 licensed |
| Smart2Brain | User Interfaces & Self-hosted PlatformsLocal AI Chat UIs & Personal Assistants | — | MIT | TOOL | Privacy-focused Obsidian plugin for AI-powered second brain functionality. Chat with your notes using local or remote LLMs including Ollama and OpenAI. MIT licensed |
| smolagents | Agentic AI & Multi-Agent SystemsSingle-Agent Frameworks | — | — | AGENT | Lightweight agent framework centered on tool use and code-executing workflows |
| Snorkel | Core Frameworks & LibrariesData Processing & Manipulation | — | Apache-2.0 | TOOL | System for quickly generating training data with weak supervision. Programmatically label, build, and manage training data using labeling functions and probabilistic consensus models. Powers Snorkel Flow and used by Google, Apple, and Intel. Apache 2.0 licensed |
| Soda Core | Core Frameworks & LibrariesData Quality & Validation | — | Apache-2.0 | TOOL | Data contracts engine for the modern data stack. Define data quality checks in YAML and automatically validate schema and data across your pipelines. Supports 20+ data sources including Snowflake, BigQuery, and PostgreSQL. Apache 2.0 licensed |
| spaCy (Explosion AI) | Core Frameworks & LibrariesNLP & Transformers | — | — | TOOL | Industrial-strength natural language processing with 75+ languages, transformer pipelines, and production-grade NER, parsing, and text classification |
| SpeechBrain | Open Foundation ModelsSpeech & Audio Models (TTS, STT, Music) | — | Apache-2.0 | MODEL | PyTorch-based speech toolkit for ASR, TTS, speaker recognition, and speech enhancement. Modular, extensible framework with state-of-the-art recipes. Apache 2.0 licensed |
| Spring AI | Retrieval-Augmented Generation (RAG) & KnowledgeLLM Application Frameworks | — | Apache-2.0 | TOOL | Application framework for AI engineering in the Spring ecosystem. Unified API for LLMs, vector stores, and embedding models with seamless integration into Spring Boot applications. Supports RAG, tool calling, and structured outputs. Apache 2.0 licensed |
| SPTAG (Microsoft) | Retrieval-Augmented Generation (RAG) & KnowledgeVector Databases & Search Engines | — | MIT | TOOL | Distributed approximate nearest neighbor search library with high-quality vector index build and online serving toolkits. Powers Bing's vector search at trillion-vector scale. MIT licensed |
| sqlite-vec | Retrieval-Augmented Generation (RAG) & KnowledgeVector Databases & Search Engines | — | Apache-2.0 | TOOL | A vector search SQLite extension that runs anywhere. Extremely small, "fast enough" vector search written in pure C with no dependencies. Perfect for embedded and edge deployments. MIT/Apache-2.0 dual licensed |
| SQLMesh | Core Frameworks & LibrariesData Transformation & Analytics Engineering | — | Apache-2.0 | TOOL | Scalable and efficient data transformation framework with dbt compatibility. Features automatic data lineage, time travel, and virtual data environments for testing. Optimized for large-scale data warehouses. Apache 2.0 licensed |
| Stable Audio Tools | Generative Media ToolsAudio / Music / Voice Generation | — | MIT | TOOL | Stability AI's open-source audio and music generative models. Latent diffusion model for generating audio conditioned on metadata and timing, providing faster inference times and creative control for sound effects and music production. MIT licensed |
| Stable Diffusion WebUI Forge - Neo | Generative Media ToolsImage Generation & Editing | — | — | TOOL | Actively maintained Forge-based Stable Diffusion web UI with the familiar extension-driven workflow |
| Stable Diffusion XL | Open Foundation ModelsImage Generation Models | — | — | MODEL | Next-generation image generation model with significantly improved quality, 1024px native resolution, and better prompt adherence. Foundation for SDXL-based video models. CreativeML Open RAIL++-M licensed |
| Stable-Baselines3 | Specialized DomainsReinforcement Learning & Robotics | — | — | TOOL | Production-ready RL algorithms |
| Stanza | Specialized DomainsProbabilistic Programming & Bayesian ML | — | Apache-2.0 | TOOL | Stanford NLP Python library for 100+ human languages. State-of-the-art neural pipelines for tokenization, NER, parsing, and sentiment analysis with pre-trained models. Apache 2.0 licensed |
| Start Machine Learning (louisfb01) | Resources & LearningEducational Resources & Courses | — | MIT | TOOL | A complete guide to start and improve in machine learning and AI in 2026 without any background. Curated learning path with the latest news, state-of-the-art techniques, and comprehensive resources for beginners. MIT licensed |
| StatsForecast | Core Frameworks & LibrariesClassical ML & Gradient Boosting | — | — | TOOL | Lightning-fast statistical forecasting with ARIMA, ETS, CES, and Theta models. Optimized for high-performance time series workloads |
| Steel Browser | Agentic AI & Multi-Agent SystemsDomain-Specific Agents | — | Apache-2.0 | AGENT | Open-source browser API for AI agents and apps. Batteries-included browser sandbox for web automation without infrastructure worries. Apache 2.0 licensed |
| Strands Agents | Agentic AI & Multi-Agent SystemsSingle-Agent Frameworks | — | Apache-2.0 | AGENT | Model-driven approach to building AI agents in just a few lines of code. Multi-agent systems, autonomous agents, and streaming support with built-in MCP. Apache 2.0 licensed |
| Streaming (MosaicML) | Training & Fine-tuning EcosystemDistributed Training | — | Apache-2.0 | TOOL | High-performance data streaming library for efficient neural network training. Streams training data from cloud storage (S3, GCS, Azure) with local caching and deterministic shuffling. Apache 2.0 licensed |
| Streamlit | Core Frameworks & LibrariesInteractive ML Apps & Notebooks | — | — | TOOL | The fastest way to build and share data apps. Transform Python scripts into beautiful web applications with minimal code. Widely used for ML model demos, data visualization, and internal tools |
| Superagent | AI Safety, Alignment & InterpretabilityAdversarial & Red-teaming Tools | — | MIT | TOOL | Protects AI applications against prompt injections, data leaks, and harmful outputs. Embed safety directly into your app and prove compliance to your customers. MIT licensed |
| SurfSense | Retrieval-Augmented Generation (RAG) & KnowledgeRAG Frameworks & Advanced Retrieval Tools | — | — | TOOL | Privacy-focused NotebookLM-style workspace for teams to search, organize, and query knowledge with self-hosted RAG |
| Swarms | Agentic AI & Multi-Agent SystemsMulti-Agent Orchestration | — | — | AGENT | Bleeding-edge enterprise multi-agent orchestration |
| SWE-bench | Evaluation, Benchmarks & DatasetsBenchmark Suites | — | — | BENCHMARK | Evaluates LLMs on real-world GitHub issues from 15+ Python repositories |
| SWE-rebench (Nebius) | Evaluation, Benchmarks & DatasetsBenchmark Suites | — | — | BENCHMARK | Continuously updated benchmark with 21,000+ real-world SWE tasks for evaluating agentic LLMs. Decontaminated, mined from GitHub |
| Sweetviz | Core Frameworks & LibrariesData Processing & Manipulation | — | MIT | TOOL | Beautiful, high-density visualizations for exploratory data analysis in two lines of code. Self-contained HTML reports for dataset comparison and target analysis. MIT licensed |
| Symphony | Agentic AI & Multi-Agent SystemsMulti-Agent Orchestration | — | Apache-2.0 | AGENT | Turns project work into isolated, autonomous implementation runs. Monitors work boards, spawns agents to handle tasks, and provides proof of work including CI status, PR reviews, and walkthrough videos. Engineering preview for managing work instead of supervising coding agents. Apache 2.0 licensed |
| SynapseML | Core Frameworks & LibrariesClassical ML & Gradient Boosting | — | MIT | TOOL | Distributed machine learning on Apache Spark. Scalable, composable APIs for text analytics, vision, anomaly detection with seamless Python/Scala/R/.NET integration. MIT licensed |
| T5 (Google) | Open Foundation ModelsLarge Language Models (Base + Chat) | — | Apache-2.0 | MODEL | Text-to-Text Transfer Transformer that unified NLP tasks under a single encoder-decoder architecture. The foundation for Flan-T5 and many downstream applications. One of the first OSI-validated fully open-source language models with training data and code. Apache 2.0 licensed |
| Tabby | Developer Tools & IntegrationsAI Coding Assistants (open-source) | — | — | TOOL | Self-hosted AI coding assistant |
| TabbyAPI | Inference Engines & ServingHigh-performance Serving & API Servers | — | — | TOOL | FastAPI-based API server for ExLlamaV2/V3 backends. OpenAI-compatible API with support for model loading/unloading, embeddings, speculative decoding, multi-LoRA, and streaming |
| Tantivy | Retrieval-Augmented Generation (RAG) & KnowledgeVector Databases & Search Engines | — | — | TOOL | Full-text search engine library inspired by Apache Lucene and written in Rust. Powers Quickwit and other production search systems |
| Temporal | Core Frameworks & LibrariesData Processing & Manipulation | — | MIT | TOOL | Durable execution platform for reliable workflow orchestration. Build resilient data pipelines and ML workflows that survive failures and continue execution exactly where they left off. MIT licensed |
| TensorFlow | Core Frameworks & LibrariesDeep Learning Frameworks | — | — | TOOL | End-to-end platform with excellent production deployment, TPU support, and large-scale serving tools |
| TensorFlow Model Garden | MLOps / LLMOps & ProductionModel Hubs & Registries | — | Apache-2.0 | TOOL | Official TensorFlow repository of state-of-the-art (SOTA) models and modeling solutions. Contains reference implementations for BERT, ResNet, Transformer, and many more with pre-trained weights and training scripts. Apache 2.0 licensed |
| TensorFlow Tutorials | Resources & LearningStarter Projects & Examples | — | — | TOOL | Official guides for beginners to advanced users |
| TensorRT-LLM | Inference Engines & ServingHigh-performance Serving & API Servers | — | — | TOOL | NVIDIA's official high-performance inference backend |
| TensorZero | MLOps / LLMOps & ProductionMonitoring, Evaluation & Observability | — | — | TOOL | Open-source LLMOps platform unifying LLM gateway, observability, evaluation, and experimentation. Production-grade with sub-1ms latency, used by Fortune 10 companies |
| Text Embeddings Inference (Hugging Face) | Retrieval-Augmented Generation (RAG) & KnowledgeEmbedding Models | — | Apache-2.0 | TOOL | Blazing fast inference solution for text embedding models. High-performance extraction with token-based dynamic batching, Flash Attention, and support for FlagEmbedding, E5, GTE, and more. OpenAI-compatible API with Docker deployment. Apache 2.0 licensed |
| text-generation-webui | User Interfaces & Self-hosted PlatformsLocal AI Chat UIs & Personal Assistants | — | — | TOOL | Web UI for running local LLMs with multiple backends, extensions, and model formats |
| TextAttack | Core Frameworks & LibrariesData Processing & Manipulation | — | MIT | TOOL | Python framework for adversarial attacks, data augmentation, and model training in NLP. Augment datasets to increase model robustness and generate adversarial examples. MIT licensed |
| TFX (TensorFlow Extended) | Core Frameworks & LibrariesData Quality & Validation | — | Apache-2.0 | TOOL | End-to-end platform for deploying production ML pipelines. Data validation, transformation, model training, and serving with TensorFlow. Powers Google's production ML infrastructure. Apache 2.0 licensed |
| The Incredible PyTorch | Resources & LearningCourses & Interactive Playgrounds | — | — | TOOL | Curated list of PyTorch tutorials, papers, projects, and communities for deep learning researchers |
| Tianshou | Specialized DomainsGame AI & Simulations | — | MIT | TOOL | An elegant PyTorch deep reinforcement learning library with clean API design and comprehensive algorithm implementations. Supports both single-agent and multi-agent RL with GPU acceleration. MIT licensed |
| Time Series Library (TSLib) | Specialized DomainsTime Series & Scientific AI | — | — | TOOL | Comprehensive benchmark for time-series models |
| timm (PyTorch Image Models) | Core Frameworks & LibrariesDeep Learning Frameworks | — | Apache-2.0 | TOOL | The largest collection of PyTorch image encoders and backbones. 900+ pretrained models including ResNet, EfficientNet, Vision Transformer, ConvNeXt, and more with training and inference scripts. Apache 2.0 licensed |
| tinygrad | Core Frameworks & LibrariesDeep Learning Frameworks | — | — | TOOL | Minimalist deep learning framework with tiny code footprint. The "you like PyTorch? you like micrograd? you love tinygrad!" philosophy - simple yet powerful |
| TinyZero | Training & Fine-tuning EcosystemFull Training Frameworks | — | Apache-2.0 | TOOL | Minimal reproduction of DeepSeek R1-Zero for countdown and multiplication tasks. Clean, accessible implementation for understanding RL-based reasoning training. Apache 2.0 licensed |
| tokenizers (Hugging Face) | Core Frameworks & LibrariesNLP & Transformers | — | — | TOOL | Fast state-of-the-art tokenizers for training and inference |
| ToolJet | Agentic AI & Multi-Agent SystemsDomain-Specific Agents | — | — | AGENT | Self-hostable internal app builder with AI app and agent workflows for operations teams |
| torchao | Core Frameworks & LibrariesModel Training & Optimization Utilities | — | — | TOOL | PyTorch native quantization and sparsity for training and inference. Drop-in optimizations for production deployment |
| torchaudio | Specialized DomainsComputer Vision | — | — | TOOL | PyTorch audio processing library. Comprehensive toolkit for audio I/O, transformations, and deep learning with support for speech recognition, TTS, and audio classification. BSD-2-Clause licensed |
| TorchGeo | Specialized DomainsScientific AI & Physics ML | — | MIT | TOOL | PyTorch domain library for geospatial data. Datasets, samplers, transforms, and pre-trained models for multispectral satellite imagery and remote sensing. First library with pre-trained models for Sentinel-2 bands. MIT licensed |
| torchmetrics | Core Frameworks & LibrariesModel Training & Optimization Utilities | — | — | TOOL | Machine learning metrics for distributed, scalable PyTorch applications. 80+ metrics with built-in distributed synchronization |
| TorchTitan (PyTorch) | Training & Fine-tuning EcosystemFull Training Frameworks | — | BSD-3-Clause | TOOL | PyTorch native platform for training generative AI models at scale. Showcases 4D parallelism (FSDP, tensor, pipeline, context) for LLM pretraining with 65%+ speedups over optimized baselines. BSD-3-Clause licensed |
| torchtune | Training & Fine-tuning EcosystemFull Training Frameworks | — | — | TOOL | PyTorch-native library for post-training, fine-tuning, and experimentation with LLMs |
| TorchVision Models | MLOps / LLMOps & ProductionModel Hubs & Registries | — | BSD-3-Clause | TOOL | PyTorch's official computer vision library with 50+ pre-trained model architectures including ResNet, EfficientNet, Vision Transformers (ViT), ConvNeXt, and more. The de facto standard model zoo for PyTorch computer vision. BSD-3-Clause licensed |
| Tracecat | Agentic AI & Multi-Agent SystemsDomain-Specific Agents | — | — | AGENT | Self-hostable security automation platform for building agentic workflows across alerts, cases, and operations |
| TradingAgents | Agentic AI & Multi-Agent SystemsDomain-Specific Agents | — | — | AGENT | Multi-agent framework for financial trading. Simulates professional trading firm operations with 6+ specialized agent roles, backtesting, risk management, and portfolio optimization. Built with LangGraph, supports multiple LLM providers |
| Trae Agent | Agentic AI & Multi-Agent SystemsAutonomous Coding Agents | — | — | AGENT | Software-engineering agent from ByteDance for autonomous coding tasks and repository-level development workflows |
| TransformerLens | AI Safety, Alignment & InterpretabilityInterpretability & Explainability | — | — | TOOL | Gold-standard for mechanistic interpretability |
| Transformers (Hugging Face) | Core Frameworks & LibrariesNLP & Transformers | — | — | TOOL | The de facto standard library for pretrained NLP models. 1M+ models, 250,000+ downloads/day. BERT, GPT, Llama, Qwen, and hundreds more |
| Transformers Tutorials (Niels Rogge) | Resources & LearningCourses & Interactive Playgrounds | — | — | TOOL | Comprehensive tutorials and demos using the Hugging Face Transformers library for NLP, vision, and multimodal tasks |
| Transformers.js | MLOps / LLMOps & ProductionModel Hubs & Registries | — | Apache-2.0 | TOOL | State-of-the-art Machine Learning for the web. Run Hugging Face Transformers directly in your browser with no server needed. Supports 1000+ models including BERT, GPT-2, T5, and more via ONNX Runtime Web. Apache 2.0 licensed |
| Triton | Core Frameworks & LibrariesDeep Learning Frameworks | — | MIT | TOOL | Language and compiler for writing highly efficient custom deep-learning primitives. Powers kernel optimizations in PyTorch, JAX, and other frameworks. MIT licensed |
| Triton Inference Server | Inference Engines & ServingHigh-performance Serving & API Servers | — | — | TOOL | NVIDIA's production-grade open-source inference serving software. Supports multiple frameworks (TensorRT, PyTorch, ONNX) with optimized cloud and edge deployment |
| TRL (Transformers Reinforcement Learning) | Training & Fine-tuning EcosystemFull Training Frameworks | — | — | TOOL | Official library for RLHF, SFT, DPO, ORPO |
| TruLens | Evaluation, Benchmarks & DatasetsEvaluation Frameworks | — | MIT | TOOL | Evaluation and tracking for LLM experiments and AI agents. Provides feedback functions for measuring quality, relevance, and groundedness with LangChain and LlamaIndex integrations. MIT licensed |
| txtai | Retrieval-Augmented Generation (RAG) & KnowledgeRAG Frameworks & Advanced Retrieval Tools | — | — | TOOL | All-in-one AI framework for semantic search, LLM orchestration and language model workflows. Embeddings database with customizable pipelines |
| Typesense | Retrieval-Augmented Generation (RAG) & KnowledgeVector Databases & Search Engines | — | — | TOOL | Open source alternative to Algolia + Pinecone. Fast, typo-tolerant, in-memory fuzzy search engine with native vector search capabilities. GPL-3.0 licensed |
| uAgents (Fetch.ai) | Agentic AI & Multi-Agent SystemsDomain-Specific Agents | — | Apache-2.0 | AGENT | Fast and lightweight framework for creating decentralized agents with ease. Agents automatically join the network by registering on the Almanac smart contract. Supports agent-to-agent communication out of the box. Apache 2.0 licensed |
| UI-TARS Desktop (ByteDance) | Agentic AI & Multi-Agent SystemsDomain-Specific Agents | — | Apache-2.0 | AGENT | Open-source multimodal AI agent stack with native GUI agent capabilities. Desktop application bringing GUI agent and vision power to your computer, browser, and terminal. Apache 2.0 licensed |
| Ultralytics YOLO | Specialized DomainsComputer Vision | — | — | TOOL | State-of-the-art real-time object detection |
| UltraRAG (OpenBMB) | Retrieval-Augmented Generation (RAG) & KnowledgeRAG Frameworks & Advanced Retrieval Tools | — | Apache-2.0 | TOOL | First lightweight RAG framework based on Model Context Protocol (MCP) architecture. Low-code RAG pipeline builder with comprehensive evaluation system and DeepResearch capabilities. From Tsinghua THUNLP, NEUIR, OpenBMB, and AI9stars. Apache 2.0 licensed |
| Ultravox (Fixie AI) | Open Foundation ModelsSpeech & Audio Models (TTS, STT, Music) | — | MIT | MODEL | Fast multimodal LLM for real-time voice. Production-grade speech-to-text with streaming audio input and low-latency response for conversational AI applications. MIT licensed |
| Unity ML-Agents | Specialized DomainsGame AI & Simulations | — | Apache-2.0 | TOOL | Toolkit for training intelligent agents in games and simulations using deep reinforcement learning. Enables NPC behavior control, automated testing, and game design evaluation. Apache 2.0 licensed |
| Unsloth | Training & Fine-tuning EcosystemFull Training Frameworks | — | — | TOOL | 2× faster, 70% less memory fine-tuning |
| Unstructured | Retrieval-Augmented Generation (RAG) & KnowledgeRAG Frameworks & Advanced Retrieval Tools | — | — | TOOL | Best-in-class document preprocessing |
| Upscayl | Generative Media ToolsImage Generation & Editing | — | — | TOOL | Free and open-source AI image upscaler for Linux, macOS, and Windows. Uses Real-ESRGAN and Vulkan architecture to enhance images by reconstructing high-resolution details. Cross-platform desktop app with batch processing. AGPL-3.0 licensed |
| Upsonic | Agentic AI & Multi-Agent SystemsSingle-Agent Frameworks | — | — | AGENT | Agent framework for fintech and banking with built-in MCP support, guardrails, and tool server architecture |
| USearch | Retrieval-Augmented Generation (RAG) & KnowledgeVector Databases & Search Engines | — | Apache-2.0 | TOOL | Fast single-file similarity search & clustering engine for vectors. Smaller and faster than FAISS with 20+ language bindings (C++, Python, JavaScript, Rust, Java, Go, etc.) and support for custom metrics. Apache 2.0 licensed |
| uv | Core Frameworks & LibrariesData Processing & Manipulation | — | Apache-2.0 | TOOL | An extremely fast Python package and project manager, written in Rust. 10-100x faster than pip with built-in virtual environment management, dependency resolution, and lockfiles. Essential for modern AI/ML development workflows. Apache 2.0 and MIT dual-licensed |
| Vaex | Core Frameworks & LibrariesData Processing & Manipulation | — | MIT | TOOL | Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python. Visualize and explore billion-row datasets at millions of rows per second. MIT licensed |
| Vald | Retrieval-Augmented Generation (RAG) & KnowledgeVector Databases & Search Engines | — | Apache-2.0 | TOOL | Highly scalable distributed vector search engine. Cloud-native architecture with automatic indexing, horizontal scaling, and multiple ANN algorithm support. Apache 2.0 licensed |
| Vearch | Retrieval-Augmented Generation (RAG) & KnowledgeVector Databases & Search Engines | — | Apache-2.0 | TOOL | Cloud-native distributed vector database for AI-native applications. Efficient similarity search of embedding vectors with horizontal scaling and real-time indexing. Apache 2.0 licensed |
| Vectara Hallucination Leaderboard | Evaluation, Benchmarks & DatasetsBenchmark Suites | — | Apache-2.0 | BENCHMARK | Leaderboard comparing LLM performance at producing hallucinations when summarizing short documents. Systematic evaluation of factual consistency across major models. Apache 2.0 licensed |
| Vector | Core Frameworks & LibrariesData Processing & Manipulation | — | MPL-2.0 | TOOL | A high-performance observability data pipeline for collecting, transforming, and routing logs and metrics. Real-time data processing with 50+ sources and sinks including Kafka, S3, and Elasticsearch. Ideal for AI/ML log processing and data ingestion. MPL 2.0 licensed |
| VectorChord | Retrieval-Augmented Generation (RAG) & KnowledgeVector Databases & Search Engines | — | — | TOOL | Scalable, fast, and disk-friendly vector search in Postgres. Successor to pgvecto.rs with production-grade performance and efficient storage. AGPL-3.0 licensed |
| VectorDBBench (Zilliz) | Retrieval-Augmented Generation (RAG) & KnowledgeVector Databases & Search Engines | — | MIT | TOOL | Industry-standard benchmark suite for vector databases. Test and compare performance of Milvus, Zilliz Cloud, and other vector DBs with your own datasets. MIT licensed |
| VeOmni (ByteDance) | Training & Fine-tuning EcosystemFull Training Frameworks | — | Apache-2.0 | TOOL | Versatile framework for both single- and multi-modal pre-training and post-training. Model-centric distributed recipe zoo supporting text, vision, audio, and video models with unified training interface. Apache 2.0 licensed |
| Verba | Retrieval-Augmented Generation (RAG) & KnowledgeRAG Frameworks & Advanced Retrieval Tools | — | BSD-3-Clause | TOOL | The Golden RAGtriever - an end-to-end, streamlined, user-friendly RAG interface powered by Weaviate. Chat with your documents using hybrid search, multiple chunking strategies, and support for various LLM providers. BSD-3-Clause licensed |
| Vercel AI SDK | Developer Tools & IntegrationsSDKs & API Development Tools | — | Apache-2.0 | TOOL | Provider-agnostic TypeScript toolkit for building AI-powered applications and agents. Unified API for OpenAI, Anthropic, Google, and 20+ providers with first-class streaming, tool-calling, and structured output support. Apache 2.0 licensed |
| verl | Training & Fine-tuning EcosystemFull Training Frameworks | — | — | TOOL | Volcano Engine Reinforcement Learning for LLMs with PPO, GRPO, REINFORCE++, DAPO (EuroSys 2025) |
| veScale (ByteDance) | Training & Fine-tuning EcosystemDistributed Training | — | — | TOOL | Hyperscale PyTorch distributed training with flexible FSDP implementation for LLMs and RL training at scale |
| Vespa | Retrieval-Augmented Generation (RAG) & KnowledgeVector Databases & Search Engines | — | — | TOOL | AI + Data platform with hybrid search (vector + keyword) and real-time indexing at scale. Battle-tested serving billions of queries daily |
| VibeVoice (Microsoft) | Open Foundation ModelsSpeech & Audio Models (TTS, STT, Music) | — | — | MODEL | Open-source frontier voice AI with expressive, longform conversational speech synthesis. 7B parameter TTS with streaming support |
| VILA (NVIDIA) | Open Foundation ModelsMultimodal Models (Vision + Language) | — | Apache-2.0 | MODEL | Family of state-of-the-art vision language models for diverse multimodal AI tasks across edge, data center, and cloud. Features NVILA 8B/15B with efficient training and deployment. Apache 2.0 licensed |
| vim-ai | Developer Tools & IntegrationsIDE Plugins & Extensions | — | MIT | TOOL | AI-powered code assistant for Vim and Neovim. Generate code, edit text, and have interactive conversations with GPT models. Supports custom roles, vision capabilities, and any OpenAI-compatible API. MIT licensed |
| vit-pytorch | Core Frameworks & LibrariesDeep Learning Frameworks | — | MIT | TOOL | Comprehensive Vision Transformer (ViT) implementations in PyTorch. Reference implementations of all major vision transformer variants including ViT, DeiT, Swin, and more. MIT licensed |
| vLLM | Inference Engines & ServingHigh-performance Serving & API Servers | — | — | TOOL | State-of-the-art serving engine with PagedAttention and continuous batching. Currently the fastest production-grade LLM server |
| vLLM Production Stack | Inference Engines & ServingHigh-performance Serving & API Servers | — | — | TOOL | Kubernetes-native production stack for vLLM inference. Automated deployment, autoscaling, and monitoring for enterprise-grade LLM serving. Built by the vLLM team for seamless integration |
| VLMEvalKit | Evaluation, Benchmarks & DatasetsBenchmark Suites | — | Apache-2.0 | BENCHMARK | Open-source evaluation toolkit for large multi-modality models (LMMs). Supports 220+ LMMs and 80+ benchmarks including MMMU, MathVista, and ChartQA. Powers the OpenVLM Leaderboard. Apache 2.0 licensed |
| Void Editor | Developer Tools & IntegrationsAI-Native IDEs & Development Environments | — | Apache-2.0 | TOOL | Open-source AI-native code editor forked from VS Code. Features agentic AI editing, inline code generation, and chat interface. Designed as a Cursor alternative with full control over your data. Apache 2.0 licensed |
| Volcano | MLOps / LLMOps & ProductionDeployment & Orchestration | — | Apache-2.0 | TOOL | Cloud-native batch scheduling system for compute-intensive workloads. CNCF incubating project with gang scheduling, job dependency management, and topology-aware scheduling for AI/ML and deep learning. Apache 2.0 licensed |
| VoltAgent | Agentic AI & Multi-Agent SystemsSingle-Agent Frameworks | — | — | AGENT | TypeScript-first AI agent engineering platform with memory, RAG, workflows, MCP integration, and voice support |
| VoxCPM | Open Foundation ModelsSpeech & Audio Models (TTS, STT, Music) | — | Apache-2.0 | MODEL | Tokenizer-free diffusion autoregressive TTS with 2B parameters. Supports 30+ languages with automatic detection, creative voice design from text descriptions, and high-fidelity voice cloning. Apache 2.0 licensed |
| Voxtral TTS (Mistral) | Open Foundation ModelsSpeech & Audio Models (TTS, STT, Music) | — | — | MODEL | 4B parameter state-of-the-art TTS with zero-shot voice cloning, 9-language support, and ~90ms time-to-first-audio for voice agents |
| Voyager (Spotify) | Retrieval-Augmented Generation (RAG) & KnowledgeVector Databases & Search Engines | — | Apache-2.0 | TOOL | Spotify's next-gen approximate nearest-neighbor search library for Python and Java. Up to 10x faster than Annoy with 4x less memory, designed for production use at billion-vector scale. Apache 2.0 licensed |
| Wan2.2 (Alibaba) | Generative Media ToolsVideo Generation | — | — | TOOL | Leading open Mixture-of-Experts text-to-video model |
| Weaviate | Retrieval-Augmented Generation (RAG) & KnowledgeVector Databases & Search Engines | — | — | TOOL | GraphQL-native vector search engine |
| WebArena | Evaluation, Benchmarks & DatasetsBenchmark Suites | — | MIT | BENCHMARK | Realistic web environment for building and evaluating autonomous agents. Self-hostable benchmark with 812 diverse web tasks across shopping, CMS, Reddit, GitLab, and more. ICLR 2024. MIT licensed |
| WebLLM | Inference Engines & ServingLocal / On-device Inference | — | — | TOOL | High-performance in-browser LLM inference engine. Runs models directly in the browser with WebGPU acceleration |
| Webots | Specialized DomainsAutonomous Driving & Robotics Simulators | — | Apache-2.0 | TOOL | Open-source multi-platform robot simulator providing a complete development environment for modeling, programming, and simulating robots, vehicles, and mechanical systems. Used in education, research, and industry. Apache 2.0 licensed |
| Weights & Biases Weave | MLOps / LLMOps & ProductionExperiment Tracking & Versioning | — | — | TOOL | Open-source tracing and experiment tracking |
| Whisper (OpenAI → community forks) | Open Foundation ModelsSpeech & Audio Models (TTS, STT, Music) | — | — | MODEL | The gold-standard open speech-to-text model. Massive community fine-tunes available |
| WhisperLive | Generative Media ToolsVideo Generation | — | MIT | TOOL | Nearly-live implementation of OpenAI's Whisper for real-time speech-to-text transcription. Supports faster-whisper, tensorrt, and openvino backends with WebSocket streaming. MIT licensed |
| WhisperSpeech | Open Foundation ModelsSpeech & Audio Models (TTS, STT, Music) | — | MIT | MODEL | Open source text-to-speech system built by inverting Whisper. High-quality voice cloning with zero-shot capabilities. MIT licensed |
| Willow | User Interfaces & Self-hosted PlatformsLocal AI Chat UIs & Personal Assistants | — | Apache-2.0 | TOOL | Open source, local, and self-hosted Amazon Echo/Google Home competitive voice assistant alternative with hardware support. Apache-2.0 licensed |
| windsurf.vim | Developer Tools & IntegrationsIDE Plugins & Extensions | — | MIT | TOOL | Free, ultrafast Copilot alternative for Vim and Neovim. AI-powered code completion with low latency and large context window. MIT licensed |
| XAI | AI Safety, Alignment & InterpretabilityInterpretability & Explainability | — | — | TOOL | eXplainability toolbox for machine learning with bias evaluation and production monitoring tools |
| xFormers | Core Frameworks & LibrariesModel Training & Optimization Utilities | — | — | TOOL | Optimized transformer building blocks and attention operators for PyTorch |
| XGBoost | Core Frameworks & LibrariesClassical ML & Gradient Boosting | — | — | TOOL | Scalable, high-performance gradient boosting library. Still dominates Kaggle and tabular competitions |
| XGrammar | Agentic AI & Multi-Agent SystemsPrompt Engineering & Structured Outputs | — | Apache-2.0 | AGENT | Fast, flexible and portable structured generation engine. Default backend for vLLM, SGLang, TensorRT-LLM, and MLC-LLM with flexible grammar support and zero-overhead mask generation. Apache 2.0 licensed |
| Xinference | Inference Engines & ServingHigh-performance Serving & API Servers | — | — | TOOL | Unified, production-ready inference API for LLMs, speech, and multimodal models. Drop-in GPT replacement with single-line code changes. Supports thousands of models with auto-batching and distributed inference |
| xLLM | Inference Engines & ServingAdditional Inference Engines | — | Apache-2.0 | TOOL | High-performance inference engine optimized for Chinese AI accelerators (Cambricon MLU, Hygon DCU, Huawei Ascend). Features service-engine decoupled architecture with elastic scheduling, PD disaggregation, and global KV cache management. Powers JD.com's core retail businesses. Apache 2.0 licensed |
| XTuner | Training & Fine-tuning EcosystemFull Training Frameworks | — | Apache-2.0 | TOOL | A next-generation training engine built for ultra-large MoE models with efficient QLoRA and full-parameter fine-tuning. Apache 2.0 licensed |
| ydata-profiling | Core Frameworks & LibrariesData Quality & Validation | — | MIT | TOOL | One line of code for comprehensive data quality profiling and exploratory data analysis. Generates detailed reports for Pandas and Spark DataFrames including statistics, correlations, missing values, and data quality alerts. MIT licensed |
| Z-Image (Tongyi) | Generative Media ToolsImage Generation & Editing | — | Apache-2.0 | TOOL | Powerful and efficient image generation model family with 6B parameters. Includes Z-Image-Turbo for sub-second inference and Z-Image-Omni-Base for both generation and editing. Strong bilingual text rendering and instruction adherence. Apache 2.0 licensed |
| Zarr | Core Frameworks & LibrariesData Processing & Manipulation | — | MIT | TOOL | Chunked, compressed, N-dimensional array storage. Scalable tensor data format optimized for cloud and parallel computing. MIT licensed |
| Zasper | Developer Tools & IntegrationsNotebooks & Interactive Computing | — | — | TOOL | High-performance IDE for Jupyter Notebooks built with Go. Up to 5x less CPU and 40x less RAM than JupyterLab. Implements Jupyter's wire protocol with massive concurrency support. AGPL-3.0 licensed |
| Zed | Developer Tools & IntegrationsAI-Native IDEs & Development Environments | — | GPL | TOOL | High-performance, multiplayer code editor with built-in AI features. From the creators of Atom and Tree-sitter. Native AI agentic editing with support for any LLM provider. GPL licensed |
| ZenML | MLOps / LLMOps & ProductionDeployment & Orchestration | — | — | TOOL | Pipeline and orchestration framework for taking ML and LLM systems from development to production |
| zvec | Retrieval-Augmented Generation (RAG) & KnowledgeVector Databases & Search Engines | — | Apache-2.0 | TOOL | Lightweight, lightning-fast, in-process vector database from Alibaba. Built on Proxima (Alibaba's battle-tested vector search engine) for production-grade, low-latency similarity search. Apache 2.0 licensed |