Waza Orig
failed
Cli Tool
web_app / go · medium
560
Files
87,262
LOC
2
Frameworks
12
Languages
Pipeline State
completedRun ID
#407264Phase
doneProgress
1%Started
Finished
2026-04-13 01:31:02LLM tokens
0Previous runs
| # | Status | Phase | Started | Finished | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Repobility · open methodology · https://repobility.com/research/ | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| #186516 | failed | EXTENDED_ANALYSIS | 2026-04-10 20:01:24 | — | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Pipeline Metadata
Stage
CatalogedDecision
proceedNovelty
65.60Framework unique
—Isolation
—Last stage change
2026-05-10 03:35:02Deduplication group #1916883
Member of a group with 3 similar repo(s) — canonical #184690 view group →
Repobility · code-quality intelligence platform · https://repobility.com
AI Prompt
Create a command-line interface (CLI) tool written in Go that evaluates AI agent skills. The tool should allow users to initialize a project workspace, create new skills, and run evaluations defined in YAML files. Key features needed include running benchmarks, comparing results across different models using JSON files, and suggesting token optimizations for skills. It should also support checking skill readiness and suggesting evaluation suites.
go cli ai-agent evaluation benchmark command-line tooling
Generated by gemma4:latest
Catalog Information
Create a command-line interface (CLI) tool written in Go that evaluates AI agent skills. The tool should allow users to initialize a project workspace, create new skills, and run evaluations defined in YAML files. Key features needed include running benchmarks, comparing results across different models using JSON files, and suggesting token optimizations for skills. It should also support checking skill readiness and suggesting evaluation suites.
Tags
go cli ai-agent evaluation benchmark command-line tooling
Languages
Frameworks
Astro Vite
Symbols
function666
method301
struct286
variable235
constant164
interface36
type_alias24
Threat Findings
Repobility · severity-and-effort ranking · https://repobility.com
34
Total Threats
2
Critical
19
High
Embed Badge
Add to your README:
