all AI news
Topic: residual
Optimal time sampling in physics-informed neural networks
2 days, 1 hour ago |
arxiv.org
[D] Why transformers are not trained layer-wise?
6 days, 15 hours ago |
www.reddit.com
CLEANing Cygnus A deep and fast with R2D2
1 week, 1 day ago |
arxiv.org
Rewiring the Transformer with Depth-Wise LSTMs
3 weeks, 6 days ago |
arxiv.org
Score Operator Newton transport
1 month, 2 weeks ago |
arxiv.org
DiffRed: Dimensionality Reduction guided by stable rank
1 month, 2 weeks ago |
arxiv.org
Stacking as Accelerated Gradient Descent
1 month, 3 weeks ago |
arxiv.org
Generalizing Cooperative Eco-driving via Multi-residual Task Learning
1 month, 3 weeks ago |
arxiv.org
Residual Multi-Fidelity Neural Network Computing
1 month, 3 weeks ago |
arxiv.org
Towards Provable Log Density Policy Gradient
1 month, 3 weeks ago |
arxiv.org
[D] Why do GLUs (Gated Linear Units) work?
1 month, 3 weeks ago |
www.reddit.com
[D] Why transformers are not trained layer-wise?
6 days, 15 hours ago |
www.reddit.com
Items published with this topic over the last 90 days.
Latest
Optimal time sampling in physics-informed neural networks
2 days, 1 hour ago |
arxiv.org
[D] Why transformers are not trained layer-wise?
6 days, 15 hours ago |
www.reddit.com
CLEANing Cygnus A deep and fast with R2D2
1 week, 1 day ago |
arxiv.org
Rewiring the Transformer with Depth-Wise LSTMs
3 weeks, 6 days ago |
arxiv.org
Score Operator Newton transport
1 month, 2 weeks ago |
arxiv.org
DiffRed: Dimensionality Reduction guided by stable rank
1 month, 2 weeks ago |
arxiv.org
Stacking as Accelerated Gradient Descent
1 month, 3 weeks ago |
arxiv.org
Generalizing Cooperative Eco-driving via Multi-residual Task Learning
1 month, 3 weeks ago |
arxiv.org
Residual Multi-Fidelity Neural Network Computing
1 month, 3 weeks ago |
arxiv.org
Towards Provable Log Density Policy Gradient
1 month, 3 weeks ago |
arxiv.org
[D] Why do GLUs (Gated Linear Units) work?
1 month, 3 weeks ago |
www.reddit.com
Topic trend (last 90 days)
Top (last 7 days)
[D] Why transformers are not trained layer-wise?
6 days, 15 hours ago |
www.reddit.com
Jobs in AI, ML, Big Data
Data Architect
@ University of Texas at Austin | Austin, TX
Data ETL Engineer
@ University of Texas at Austin | Austin, TX
Lead GNSS Data Scientist
@ Lurra Systems | Melbourne
Senior Machine Learning Engineer (MLOps)
@ Promaton | Remote, Europe
Risk Management - Machine Learning and Model Delivery Services, Product Associate - Senior Associate-
@ JPMorgan Chase & Co. | Wilmington, DE, United States
Senior ML Engineer (Speech/ASR)
@ ObserveAI | Bengaluru