June 26, 2022, 6:11 a.m. | /u/RodObr

Machine Learning www.reddit.com

Let's say we have a time series problem where we are trying to use past information to predict future inputs. Like stock prices, or heart rates, or a language model that receives one word at a time.

In theory you would want each output at t to contain the maximum amount of predictive information about label t+1.

Let's say you attach a second network to this RNN, which tries to predict hidden state t+1 from hidden state t and add …

machinelearning modelling

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Senior Business Intelligence Developer / Analyst

@ Transamerica | Work From Home, USA

Data Analyst (All Levels)

@ Noblis | Bethesda, MD, United States