March 5, 2024, 2:52 p.m. | Si Sun, Hanqing Zhang, Zhiyuan Liu, Jie Bao, Dawei Song

cs.CL updates on arXiv.org arxiv.org

arXiv:2403.01999v1 Announce Type: new
Abstract: Dense Retrieval (DR) is now considered as a promising tool to enhance the memorization capacity of Large Language Models (LLM) such as GPT3 and GPT-4 by incorporating external memories. However, due to the paradigm discrepancy between text generation of LLM and DR, it is still an open challenge to integrate the retrieval and generation tasks in a shared LLM. In this paper, we propose an efficient LLM-Oriented Retrieval Tuner, namely LMORT, which decouples DR capacity …

abstract arxiv capacity challenge cs.cl gpt gpt3 gpt-4 language language models large language large language models llm memories paradigm retrieval text text generation tool type

Seeking Developers and Engineers for AI T-Shirt Generator Project

@ Chevon Hicks | Remote

Software Engineer for AI Training Data (School Specific)

@ G2i Inc | Remote

Software Engineer for AI Training Data (Python)

@ G2i Inc | Remote

Software Engineer for AI Training Data (Tier 2)

@ G2i Inc | Remote

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

Senior Associate, Data and Analytics

@ Publicis Groupe | New York City, United States