Best evaluation Tools
-
Terracotta
Terracotta is a platform that allows users to experiment with Large Language Models (LLMs) quickly and intuitively. It enables users to manage, fine-tune, and evaluate multiple LLM models in one place. Users can securely store their data, fine-tune models for classification and text generation, and compare models qualitatively and quantitatively.
-
scite
scite is a platform that helps researchers discover and understand research articles by showing how they have been cited. It allows users to read the context of citation and understand if it provides supporting or contrasting evidence for the cited claim. With over 1.2 billion Smart Citations, researchers can find expert analyses and opinions on all topics.
-
LLMonitor
Open source monitoring and production toolkit for AI apps