Yuxi’s Substack

Yuxi’s Substack

Home
Archive
About

Sitemap - 2023 - Yuxi’s Substack

Study Material for Reinforcement Learning

Q*, Reinforcement Learning and Search

Will synthetic data help?

Levels of AGI & Autonomy

How would Deepmind Gemini work?

Over-claim then Correct: A New Norm of Research in the Era of LLMs

Blockchains Require Dramatic Innovations to Prosper

Human alignment is very hard

Agent: What, Why, How.

AI is still (very) vulnerable

RL(HF) Helps LMs

Autonomous agent is a BIG bubble

Ground-truth-in-the-loop

AGI is a wrong goal

Iterative improvements from feedback for language models

Reinforcement learning is all you need, for next generation language models.

Where is the boundary for large language models?

Will AGI Emerge from Large Language Models?

AI Stores

Coming soon

© 2025 Yuxi Li
Privacy ∙ Terms ∙ Collection notice
Start writingGet the app
Substack is the home for great culture

Share