Now Supporting NVIDIA DGX™ Spark

Latest Features Available

Dual-system comparison, smarter user support agent, support for benchmarking your own inference endpoint, and more.

Versus Workspace

Compare two systems side by side with live GPU, throughput, and latency charts.

User Support Agent

Chat-based project management with auto-generated analysis, summaries, and debugging.

Bring Your Own Endpoint (BYOE)

Benchmark your own inference endpoints across concurrency levels, prompt lengths, and accuracy datasets.

Fully Automated AI Performance Benchmarking

Configure unlimited combinations of models, software, hardware, and hyperparameters in seconds.

Parameters

Configure and test across multiple dimensions simultaneously.

Concurrency Levels
Token Lengths
Request Rates
Precision Modes

Chips

Benchmark across architectures.

NVIDIA DatacenterAMD InstinctIntel Gaudi 3AMD EPYCIntel XeonRTX GPUsIntel Arc

Models

Test the latest foundation models.

GPT-OSSDeepSeekGemmaPhiQwenMistralLlama 4+

Real-Time Metrics

Track performance across every dimension.

Throughput

tok/s

Latency

TTFT/TPOT

Power Usage

Watts

Efficiency

tok/W

AI-Powered

AI-Powered Analysis with Creator

Leverage AI-powered analysis to automatically generate insights, identify bottlenecks, and receive optimization recommendations.

  • Automated report generation
  • Performance anomaly detection
  • Optimization suggestions
  • Natural language queries
Try Creator
AI-Powered Creator
Hardware Sizer

Hardware Planning

Hardware Sizer Right-Size Your AI Infrastructure

Plan GPU clusters, estimate costs, and optimize hardware selection for your AI workloads.

GPU Selection
Cluster Sizing
Cost Analysis
Performance Estimates

Real-Time Monitoring

Performance Visualization with Pulse

Real-time visualization of performance metrics across your benchmarking runs.

Pulse Performance Visualization