Feb. 16, 2024, 5:43 a.m. | Leonidas Gee, Andrea Zugarini, Leonardo Rigutini, Paolo Torroni

cs.LG updates on arXiv.org arxiv.org

arXiv:2402.09977v1 Announce Type: cross
Abstract: Real-world business applications require a trade-off between language model performance and size. We propose a new method for model compression that relies on vocabulary transfer. We evaluate the method on various vertical domains and downstream tasks. Our results indicate that vocabulary transfer can be effectively used in combination with other compression techniques, yielding a significant reduction in model size and inference time while marginally compromising on performance.

abstract applications arxiv business business applications combination compression cs.ai cs.cl cs.lg domains language language model performance tasks trade trade-off transfer type world

Software Engineer for AI Training Data (School Specific)

@ G2i Inc | Remote

Software Engineer for AI Training Data (Python)

@ G2i Inc | Remote

Software Engineer for AI Training Data (Tier 2)

@ G2i Inc | Remote

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

Lead Developer (AI)

@ Cere Network | San Francisco, US