The Value of Failure Taxonomy
Detecting that an AI system is failing is the easier problem. A failure taxonomy maps observable loss signals to the system layer …
Read More
Testing gear on mountains. Testing ideas with mountains of data.
Gear reviews and data projects from Boulder, Colorado
Detecting that an AI system is failing is the easier problem. A failure taxonomy maps observable loss signals to the system layer …
Read MoreWhen GPT-4 launched in March 2023, the topic mix of real user prompts shifted in the exact direction the model quality gaps would predict: …
Read MoreCalifornia Denti-Cal records show Anaheim's payment intensity rose after the March 2015 ownership transition while every peer office fell or …
Read MoreThe model quality framework originally relied on scalar ratings and active days. By replacing those inputs with ARC trajectory scores and …
Read MoreField-tested across Bear Peak, Fern Canyon, West Ridge, and Bear Canyon — a vest built for short mountain runs and the lightest carbon poles …
Read Full Review
Using synthetic data and a causal framework, I modeled why flat-rate Claude subscriptions likely broke down for heavy third-party OAuth …
Read More