Discussion about this post

User's avatar
Aaron Tay's avatar

Moral of story. Keep prompts that fail and try them every 3-6 months on newer models. You will be surprised by the progress. While some of it could be "data leakage" or some training targetting your use case but if its obscure enough is unlikely.

Expand full comment
Paul Cook's avatar

Fascinating work as always! But Step Brothers is a modern classic. Give it a chance!

Expand full comment
4 more comments...

No posts