Yuxi’s Substack
Subscribe
Sign in
Home
Archive
About
Latest
Top
Discussions
Reflection 2024, Guesstimation 2025
1. RL returns.
Dec 31, 2024
•
Yuxi Li
Share this post
Yuxi’s Substack
Reflection 2024, Guesstimation 2025
Copy link
Facebook
Email
Notes
More
August 2024
LLMs: Agent? AI? Computer? Business?
Since the launch of ChatGPT in November 2022, LLMs has been a hot topic, among people from both industry and academia, as well as people from basically…
Aug 26, 2024
•
Yuxi Li
2
Share this post
Yuxi’s Substack
LLMs: Agent? AI? Computer? Business?
Copy link
Facebook
Email
Notes
More
Seeking reliable signal
Seeking signal, especially reliable signal, and avoiding noise, is universal.
Aug 12, 2024
•
Yuxi Li
3
Share this post
Yuxi’s Substack
Seeking reliable signal
Copy link
Facebook
Email
Notes
More
November 2023
Study Material for Reinforcement Learning
David Silver, Reinforcement Learning Course (classic)
Nov 24, 2023
•
Yuxi Li
3
Share this post
Yuxi’s Substack
Study Material for Reinforcement Learning
Copy link
Facebook
Email
Notes
More
Q*, Reinforcement Learning and Search
with applications in LLMs
Nov 24, 2023
•
Yuxi Li
5
Share this post
Yuxi’s Substack
Q*, Reinforcement Learning and Search
Copy link
Facebook
Email
Notes
More
Will synthetic data help?
A short answer is: it depends.
Nov 24, 2023
•
Yuxi Li
2
Share this post
Yuxi’s Substack
Will synthetic data help?
Copy link
Facebook
Email
Notes
More
Levels of AGI & Autonomy
Deepmind published a paper discussing levels of AGI and autonomy.
Nov 14, 2023
•
Yuxi Li
2
Share this post
Yuxi’s Substack
Levels of AGI & Autonomy
Copy link
Facebook
Email
Notes
More
How would Deepmind Gemini work?
Deepmind will launch Gemini soon, reportedly.
Nov 8, 2023
•
Yuxi Li
Share this post
Yuxi’s Substack
How would Deepmind Gemini work?
Copy link
Facebook
Email
Notes
More
October 2023
Over-claim then Correct: A New Norm of Research in the Era of LLMs
In the era of LLMs, it is a norm that people make many, bold, general claims.
Oct 24, 2023
•
Yuxi Li
2
Share this post
Yuxi’s Substack
Over-claim then Correct: A New Norm of Research in the Era of LLMs
Copy link
Facebook
Email
Notes
More
September 2023
Blockchains Require Dramatic Innovations to Prosper
Blockchains calls for killer apps
Sep 25, 2023
•
Yuxi Li
1
Share this post
Yuxi’s Substack
Blockchains Require Dramatic Innovations to Prosper
Copy link
Facebook
Email
Notes
More
Human alignment is very hard
Should physicist J.
Sep 4, 2023
•
Yuxi Li
1
Share this post
Yuxi’s Substack
Human alignment is very hard
Copy link
Facebook
Email
Notes
More
August 2023
Agent: What, Why, How.
Agent is a core concept in AI.
Aug 31, 2023
•
Yuxi Li
4
Share this post
Yuxi’s Substack
Agent: What, Why, How.
Copy link
Facebook
Email
Notes
More
Share
Copy link
Facebook
Email
Notes
More
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts