Robotics Policy Optimization on 100 drones (game theory) | allainews.com

Aug. 14, 2023, noon | code_your_own_AI

code_your_own_AI www.youtube.com

Two simple examples to optimize reward functions (transformer based) for RL of a fleet of taxis in New York (learning from their environment interactions) and Reinforcement Learning (RL multi-agents) for swarm intelligence of 100 drones exploring Jupiter's stormy atmosphere.

Open Problems and Fundamental Limitations of
Reinforcement Learning from Human Feedback
https://arxiv.org/pdf/2307.15217.pdf

#ai
#reinforcementlearning
#datascience

agents atmosphere drones environment examples feedback functions game game theory human human feedback intelligence interactions jupiter limitations optimization policy reinforcement reinforcement learning reinforcementlearning robotics simple theory transformer

More from www.youtube.com / code_your_own_AI

480B LLM as 128x4B MoE? WHY? 1 day ago | www.youtube.com

architecture architectures causal comparison +15

No more Fine-Tuning: Unsupervised ICL+ 2 days, 12 hours ago | www.youtube.com

advanced autonomous context deepmind +17

NEW Phi-3 mini 3.8B LLM for Your PHONE: 1st TEST 3 days, 2 hours ago | www.youtube.com

datasets llama llama 3 llm +9

BEST LLMs for Coding, Long Context, Overall Perform 4 days ago | www.youtube.com

april benchmark benchmarks coding +12

Next-Gen AI: RecurrentGemma (Long Context Length) 5 days, 22 hours ago | www.youtube.com

architecture attention brand complexity +17

Gemini 1.5 PRO vs Lllama3-70B-Instruct: TEST 6 days, 4 hours ago | www.youtube.com

70b causal gemini gemini 1.5 +8

Llama 3 70B Instruct: A Logical Reasoning Test #ai 1 week ago | www.youtube.com

70b causal context llama +11

Mighty New TransformerFAM (Feedback Attention Mem) 1 week, 2 days ago | www.youtube.com

ai research architecture attention block +11

INFINI Attention explained: 1 Mio Context Length 1 week, 3 days ago | www.youtube.com

attention context explained format +8

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

View on ai-jobs.net

Reporting & Data Analytics Lead (Sizewell C)

@ EDF | London, GB

View on ai-jobs.net

Data Analyst

@ Notable | San Mateo, CA

View on ai-jobs.net