Learning to Route by Task for Efficient Inference | allainews.com

Jan. 14, 2022, 7:10 p.m. | Google AI (noreply@blogger.com)

Google AI Blog ai.googleblog.com

Posted by Sneha Kudugunta, Research Software Engineer and Orhan Firat, Research Scientist, Google Research

Scaling large language models has resulted in significant quality improvements natural language understanding (T5), generation (GPT-3) and multilingual neural machine translation (M4). One common approach to building a larger model is to increase the depth (number of layers) and width (layer dimensionality), simply enlarging existing dimensions of the network. Such dense models take an input sequence (divided into smaller components, …

deep learning emnlp google translate learning natural language processing route

More from ai.googleblog.com / Google AI Blog

Generative AI to quantify uncertainty in weather forecasting 4 weeks ago | ai.googleblog.com

climate decisions engineer example +17

AutoBNN: Probabilistic time series forecasting with compositional bayesian neural networks 4 weeks ago | ai.googleblog.com

bayesian data economic engineer +23

Computer-aided diagnosis for lung cancer screening 1 month ago | ai.googleblog.com

cancer cancer screening computer diagnosis +16

Using AI to expand global access to reliable flood forecasts 1 month ago | ai.googleblog.com

billion disaster engineering environment +13

ScreenAI: A visual language model for UI and visually-situated language understanding 1 month ago | ai.googleblog.com

charts communication design diagrams +24

SCIN: A new resource for representative dermatology images 1 month, 1 week ago | ai.googleblog.com

crowd-sourcing dataset datasets dermatology +14

MELON: Reconstructing 3D objects from images with unknown poses 1 month, 1 week ago | ai.googleblog.com

3d objects capacity computer vision engineer +16

HEAL: A framework for health equity assessment of machine learning performance 1 month, 1 week ago | ai.googleblog.com

assessment clinical core differences +17

Cappy: Outperforming and boosting large multi-task language models with a small scorer 1 month, 1 week ago | ai.googleblog.com

boosting engineers framework google +25

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

View on ai-jobs.net

Machine Learning Engineer (m/f/d)

@ StepStone Group | Düsseldorf, Germany

View on ai-jobs.net

2024 GDIA AI/ML Scientist - Supplemental

@ Ford Motor Company | United States

View on ai-jobs.net