Discussion about this post

User's avatar
aholan@poynter.org's avatar

Great post. I am very interested in thinking through how we can do more systematic testing of hallucinations and sourcing.

Expand full comment
Stephen Fitzpatrick's avatar

Another superb post. I just don't think many folks are testing these models in the way that you are, but I would think that someone, somewhere should be doing this. I've used your prompt in Claude multiple times and the results are fantastic. What do you make of the new Google AI Search mode? Ridiculously primitive compared to what you're doing, but do you think it will continue to get better and better? As you say, it would seem like many of these problems should be fixable. Nice work!

Expand full comment
5 more comments...

No posts