all AI news
Think Before You Act: Unified Policy for Interleaving Language Reasoning with Actions. (arXiv:2304.11063v1 [cs.CL])
cs.CL updates on arXiv.org arxiv.org
The success of transformer models trained with a language modeling objective
brings a promising opportunity to the reinforcement learning framework.
Decision Transformer is a step towards this direction, showing how to train
transformers with a similar next-step prediction objective on offline data.
Another important development in this area is the recent emergence of
large-scale datasets collected from the internet, such as the ones composed of
tutorial videos with captions where people talk about what they are doing. To
take advantage …
act arxiv data datasets decision development emergence framework interleaving internet language modeling next offline people policy prediction reasoning reinforcement reinforcement learning scale success talk think transformer transformer models transformers tutorial videos