Encord
NEWEncord is a computer vision data management platform with AI-assisted labeling, active learning, and model evaluation. Reduces labeling…
Superannotate
NEWSuperAnnotate is an end-to-end AI data platform combining annotation tools, quality automation, and model evaluation. Enterprise teams manage…
Kili Technology
NEWKili Technology is a training data platform with consensus labeling, automated QA, and workforce management. Builds high-quality labeled…
DeepEval
NEWDeepEval is an open-source LLM evaluation framework with 14+ evaluation metrics including hallucination, answer relevancy, and faithfulness. pytest-style…
Portkey AI
NEWPortkey is an AI gateway providing unified access to 200+ LLMs with built-in observability, caching, and fallbacks. Production-grade…
Unify AI
NEWUnify automatically routes LLM requests to the cheapest or fastest provider based on your optimization criteria. Benchmark any…
Eden AI
NEWEden AI provides a unified API for 100+ AI models across text, image, audio, and video. Test and…
Agenta
NEWAgenta is an open-source LLMOps platform for prompt management, evaluation, and deployment. Teams collaborate on prompts, run systematic…
HoneyHive
NEWHoneyHive is an AI evaluation and observability platform for teams building LLM applications. Dataset management, automated evaluations, and…
Mirascope
NEWMirascope is a Python toolkit for building LLM applications with clean abstractions for prompts, calls, and extractions. Type-safe…
PromptLayer
NEWPromptLayer is a platform for tracking, managing, and evaluating LLM prompts in production. Log every prompt and completion,…
Langfuse
NEWLangfuse is an open-source LLM engineering platform for observability, testing, and prompt management. Debug production AI issues, evaluate…