Sept. 26, 2023, midnight | schmidphilipp1995@gmail.com (Philipp Schmid)

philschmid blog www.philschmid.de

Benchmark evaluating varying sizes of Llama 2 on a range of Amazon EC2 instance types with different load levels on latency (ms per token), and throughput (tokens per second).

amazon amazon ec2 amazon sagemaker benchmark ec2 huggingface instance latency llama llama 2 llm per sagemaker token tokens types

AI Research Scientist

@ Vara | Berlin, Germany and Remote

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Robotics Technician - 3rd Shift

@ GXO Logistics | Perris, CA, US, 92571