Reinforcement Learning

A community dedicated to discussions on reinforcement learning, a subdiscipline of machine learning that tackles sequential decision making problems.

Members

Posts

Active Today

Created

1 yr. ago

Sort

Reinforcement Learning @lemmy.ca
howrar @lemmy.ca
2mo ago

jackhopkins.github.io Factorio Learning Environment

0
Reinforcement Learning @lemmy.ca
howrar @lemmy.ca
2mo ago

www.acm.org Andrew Barto and Richard Sutton are the recipients of the 2024 ACM A.M. Turing Award for developing the conceptual and algorithmic foundations of reinforcement learning.
Andrew Barto and Richard Sutton are the recipients of the 2024 ACM A.M. Turing Award for developing the conceptual and algorithmic foundations of reinforcement learning. In a series of papers beginning in the 1980s, Barto and Sutton introduced the main ideas, constructed the mathematical foundations...

0
Reinforcement Learning @lemmy.ca
howrar @lemmy.ca
3mo ago

Open Sourcing π₀

www.physicalintelligence.company Open Sourcing π0
Physical Intelligence is bringing general-purpose AI into the physical world.

0
Reinforcement Learning @lemmy.ca
howrar @lemmy.ca
3mo ago

A Little Bit of Reinforcement Learning from Human Feedback -- Nathan Lambert

rlhfbook.com /book.pdf

https://bsky.app/profile/natolambert.bsky.social/post/3lh5jih226k2k
Anyone interested in learning about RLHF? This text isn't complete yet, but looks to be a pretty useful resource as is already.

0
Reinforcement Learning @lemmy.ca
howrar @lemmy.ca
5mo ago

Reinforcement Learning: An Overview

arxiv.org Reinforcement Learning: A Comprehensive Overview
This manuscript gives a big-picture, up-to-date overview of the field of (deep) reinforcement learning and sequential decision making, covering value-based method, policy-gradient methods, model-based methods, and various other topics (e.g., multi-agent RL, RL+LLMs, and RL+inference).

An overview of RL published just a few days ago. 144 pages of goodies covering everything from basic RL theory to modern deep RL algorithms and various related niches.
This manuscript gives a big-picture, up-to-date overview of the field of (deep) reinforcement learning and sequential decision making, covering value-based RL, policy-gradient methods, model-based methods, and various other topics (including a very brief discussion of RL+LLMs).

0
Reinforcement Learning @lemmy.ca
howrar @lemmy.ca
7mo ago

Keynotes from the 2024 Reinforcement Learning Conference

www.youtube.com /playlist
Recordings for the RLC keynote talks have been released.
Keynote speakers:
- David Silver
- Doina Precup (Not recorded)
- Peter Stone
- Finale Doshi-Velez
- Sergey Levine
- Emma Brunskill
- Andrew Barto
0
Reinforcement Learning @lemmy.ca
howrar @lemmy.ca
8mo ago

OpenAI: Learning to Reason with LLMs

openai.com Just a moment...

OpenAI just put out a blog post about a new model trained via RL (I'm assuming this isn't the usual RLHF) to perform chain of thought reasoning before giving the user its answer. As usual, there's very little detail about how this is accomplished so it's hard for me to get excited about it, but the rest of you might find this interesting.

0
Reinforcement Learning @lemmy.ca
howrar @lemmy.ca
1y ago

Introducing SIMA, a Scalable Instructable Multiworld Agent

deepmind.google A generalist AI agent for 3D virtual environments
Introducing SIMA, a Scalable Instructable Multiworld Agent

0

0 active users