Tag: llm
All the articles with the tag "llm".
-
sdef2md: Turn any macOS app's scripting API into documentation and MCP tools
hrbrmstrA Go CLI that converts macOS .sdef scripting definitions into clean Markdown, paired with a skill that generates complete Go MCP servers from the generated reference — bridging any scriptable app into LLM agents.
-
Stop trusting LLM benchmarks
hrbrmstrEight major AI benchmarks can be gamed to near-perfect scores without solving tasks. Berkeley researchers show the scoring harnesses were never secure — and scores already inflated in the wild.