AI assistants are far from flawless, failing critical structured output tasks ...