AI Developer Tools & Libraries

Many of these tools are open source and also appear in our AI Tools Directory. You can explore more live projects on the GitHub Trending page.

MLflow

Open source platform for the ML lifecycle including experimentation, reproducibility, and deployment

Kubeflow

Machine learning toolkit for Kubernetes

DVC (Data Version Control)

Git for data and models - version control for ML projects

Metaflow

Framework for real-life data science and ML engineering by Netflix

ZenML

Extensible, open-source MLOps framework for production-ready ML pipelines

Flyte

Kubernetes-native workflow automation platform for complex, mission-critical data and ML processes

Prefect

Modern workflow orchestration for data and ML pipelines

ClearML

End-to-end MLOps suite for managing experiments, orchestrating workflows, and deploying models

LangChain

Framework for developing applications powered by language models

LlamaIndex

Data framework for LLM applications to ingest, structure, and access data

Haystack

End-to-end NLP framework for building production-ready applications

Semantic Kernel

Microsoft's SDK for integrating AI into apps with LLMs

AutoGen

Microsoft framework for building multi-agent AI systems

CrewAI

Framework for orchestrating role-playing, autonomous AI agents

Instructor

Structured outputs for LLMs with validation using Pydantic

Guidance

Programming paradigm for controlling LLMs by Microsoft

PromptTools

Testing and experimentation library for LLMs and vector databases

PromptLayer

Platform for prompt engineering with logging, templates, and analytics

LangSmith

Platform for debugging, testing, and monitoring LLM applications

Weights & Biases Prompts

Tool for tracking and versioning prompts with W&B

DSPy

Framework for algorithmically optimizing LM prompts and weights

Promptfoo

Test and evaluate LLM output quality

Guardrails AI

Add structure, type and quality guarantees to LLM outputs

DeepEval

Simple-to-use LLM evaluation framework with 14+ metrics

Giskard

Testing & evaluation library for ML models and LLMs

Ragas

Framework for evaluating RAG (Retrieval Augmented Generation) pipelines

TruLens

Instrumentation and evaluation tools for LLM apps

Evidently AI

Open-source ML observability and testing platform

Great Expectations

Data quality and validation framework

Pytest-LLM

Pytest plugin for testing LLM-powered applications

GitHub Actions for ML

CI/CD workflows for machine learning projects

CML (Continuous Machine Learning)

CI/CD for ML projects by Iterative.ai

DVC Pipelines

Define and execute ML pipelines with automatic versioning

Argo Workflows

Container-native workflow engine for orchestrating parallel jobs

Airflow

Platform to programmatically author, schedule and monitor workflows

Dagster

Data orchestration platform for developing, testing, and monitoring data assets

BentoML

Unified framework for building, shipping, and scaling ML services

Ray Serve

Scalable model serving library built on Ray

TorchServe

PyTorch model serving framework

TensorFlow Serving

Flexible, high-performance serving system for ML models

Triton Inference Server

NVIDIA's open-source inference serving software

vLLM

Fast and easy-to-use library for LLM inference and serving

Ollama

Run large language models locally

Modal

Serverless platform for running ML models and data pipelines

LangFuse

Open-source LLM engineering platform for debugging and analytics

Arize AI

ML observability platform for monitoring and troubleshooting

Weights & Biases

ML platform for experiment tracking, dataset versioning, and model management

Neptune.ai

Metadata store for MLOps with experiment tracking and model registry

Comet

Platform for tracking, comparing, and optimizing ML experiments

WhyLabs

AI observability platform for monitoring data and ML models

Phoenix

Open-source observability library for LLMs and embeddings

Pinecone

Managed vector database for ML applications

Weaviate

Open-source vector database with hybrid search capabilities

Qdrant

Vector similarity search engine with extended filtering support

Milvus

Open-source vector database built for scalable similarity search

Chroma

Open-source embedding database for LLM apps

LanceDB

Developer-friendly, serverless vector database

Pgvector

Vector similarity search for PostgreSQL

dbt (Data Build Tool)

Transform data in your warehouse using SQL

Apache Spark

Unified analytics engine for large-scale data processing

Polars

Blazingly fast DataFrame library in Rust and Python

Delta Lake

Open-source storage framework for building lakehouse architecture

Apache Iceberg

High-performance table format for huge analytic datasets

Feast

Open-source feature store for machine learning

Tecton

Enterprise feature platform for ML

Axolotl

Tool for streamlining fine-tuning of various AI models

Unsloth

Finetune Llama 3.2, Mistral, Phi & Gemma 2-5x faster with 70% less memory

PEFT

Parameter-Efficient Fine-Tuning methods by Hugging Face

LoRA

Low-Rank Adaptation for efficient LLM fine-tuning

DeepSpeed

Deep learning optimization library by Microsoft

Lit-GPT

Hackable implementation of state-of-the-art LLMs

TRL (Transformer Reinforcement Learning)

Train transformer language models with RL