TwoTail

What shall we analyze?

Recent Charts

Autonomy

Evals Last 30 days

Sandbox

Autonomy runs experiments here as it investigates. You can also start one manually.

New sandbox run

1Type

2Task

3Variants

4Evals

5Inputs

6Review

What do you want to test?

Datasets

Static collections of (input, output, eval) rows. Used for golden sets, calibration, annotation, and sandbox runs. Autonomy builds these as it works; you can also create one manually.

Loading…

Vocabulary

Help the analyst understand your agent's structure and goals

Strategy

Describe your agent's goals and what you're optimizing for

Terminology

Define business terms, KPIs, and domain-specific concepts

Data Structure

Span hierarchy and metadata fields (auto-generated from your traces)

Agent Change Log

Record dated changes to your agent (model swaps, prompt updates, deploys) so analysis can tie trends to causes

What shall we analyze?

New project

Dashboard

Chart

Your API Key

API Endpoint

Coding Agent Prompt

Quick Test

Project

Daily Update

Task

Eval Detail

New sandbox run

What do you want to test?

Which task do you want to test?

Define your variants

Which evals?

Which inputs should we run against?

Review and launch

Sandbox run

Tradeoffs

New dataset

Add row

Strategy

Terminology

Data Structure

Agent Change Log

API Keys

Create New Key

Key Created Successfully

Your Keys

Team Settings

Invite Team Members

Invite Created

Team Members

Pending Invites

What shall we analyze?

New project

Dashboard

Chart

Add to Dashboard

Your API Key

API Endpoint

Coding Agent Prompt⚙

Quick Test

Project

Daily Update

Task

Eval Detail

New sandbox run

What do you want to test?

Which task do you want to test?

Define your variants

Which evals?

Which inputs should we run against?

Review and launch

Sandbox run

Tradeoffs

New dataset

Add row

Strategy

Terminology

Data Structure

Agent Change Log

API Keys

Create New Key

Key Created Successfully

Your Keys

Team Settings

Invite Team Members

Invite Created

Team Members

Pending Invites

Welcome to TwoTail

Team Invitation

Coding Agent Prompt