How Users Vote With Their Prompts
When GPT-4 launched in March 2023, the topic mix of real user prompts shifted in the exact direction the model quality gaps would predict: …
Read More
Testing gear on mountains. Testing ideas with mountains of data.
Gear reviews and data projects from Boulder, Colorado
When GPT-4 launched in March 2023, the topic mix of real user prompts shifted in the exact direction the model quality gaps would predict: …
Read MoreCalifornia Denti-Cal records show Anaheim's payment intensity rose after the March 2015 ownership transition while every peer office fell or …
Read MoreThe model quality framework originally relied on scalar ratings and active days. By replacing those inputs with ARC trajectory scores and …
Read MoreField-tested across Bear Peak, Fern Canyon, West Ridge, and Bear Canyon — a vest built for short mountain runs and the lightest carbon poles …
Read Full Review
Using synthetic data and a causal framework, I modeled why flat-rate Claude subscriptions likely broke down for heavy third-party OAuth …
Read More
XFlow cushioning and a wider platform, tested across four terrain types from the Cragmoor trailhead to the summit of Bear Peak.
Read Full Review