Discussion about this post

User's avatar
Daniel Popescu / ⧉ Pluralisk's avatar

Couldn't agree more. Your insight on applying AlphaGo's RL approach to LLMs is realy spot on. The grounding and agency argument feels like the key for next-gen models. Fantastic read!

Expand full comment
Melfe Bulu's avatar

thx for more info about LLM with RL :)

Expand full comment

No posts