BeClaude
Industry2026-04-27

Show HN: OSS Agent I built topped the TerminalBench on Gemini-3-flash-preview

Source: Hacker News

Scored 65.2% vs google's official 47.8%, and the existing top closed source model Junie CLI's 64.3%.Since there are a lot of reports of deliberate cheating on TerminalBench 2.0 lately (https://debugml.github.io/cheating-agents/), I would like to also clarify a few...

hacker-newsgeminiagents