Reports are organized as {model}/{suite}.
| Model | Suite | Link |
|---|---|---|
gpt-4.1-nano | FullRegressionSuite | Open |
gpt-4.1-nano | SmokeTests | Open |
gpt-5-nano | FullRegressionSuite | Open |
gpt-5-nano | SmokeTests | Open |
us.amazon.nova-2-lite-v1_0 | FullRegressionSuite | Open |
us.amazon.nova-2-lite-v1_0 | SmokeTests | Open |
us.anthropic.claude-3-5-haiku-20241022-v1_0 | FullRegressionSuite | Open |
us.anthropic.claude-3-5-haiku-20241022-v1_0 | SmokeTests | Open |
us.meta.llama4-scout-17b-instruct-v1_0 | FullRegressionSuite | Open |
us.meta.llama4-scout-17b-instruct-v1_0 | SmokeTests | Open |