5 Comments

Any idea which paper this is?

“There's some work where they'll simulate little basic agents and see if the representations they learn map to the tools they can use and the inputs they should have. ”

Expand full comment

Some typos from the first 2 hours: "So there's a direct path and an indirect path. and, and so the model can pick up whatever information it wants and then add that back in."

"But we have a lot more work to do on that. surprise to the Twitter guy,"

"And there's a verifier there too, right? There's the real world. You might generate a theory about the gods causing the storms, And then someone else finds cases where that isn't true."

Expand full comment

I enjoyed it, nice looser conversation. I didn't understand everything, but hey, you learn by stretching into new territory, so I appreciate the advanced discussion! 💚 🥃

Expand full comment

Hiring discussion was spot on.

Reminds me that my blog post learning RLHF in one week may still be the top Google result 🤣

Expand full comment