LLaFS: When Large Language Models Meet Few-Shot Segmentation | allainews.com

March 27, 2024, 4:46 a.m. | Lanyun Zhu, Tianrun Chen, Deyi Ji, Jieping Ye, Jun Liu

cs.CV updates on arXiv.org arxiv.org

arXiv:2311.16926v4 Announce Type: replace
Abstract: This paper proposes LLaFS, the first attempt to leverage large language models (LLMs) in few-shot segmentation. In contrast to the conventional few-shot segmentation methods that only rely on the limited and biased information from the annotated support images, LLaFS leverages the vast prior knowledge gained by LLM as an effective supplement and directly uses the LLM to segment images in a few-shot manner. To enable the text-based LLM to handle image-related tasks, we carefully design …

abstract arxiv contrast cs.cv few-shot images information knowledge language language models large language large language models llm llms paper prior segmentation support type vast

More from arxiv.org / cs.CV updates on arXiv.org

Pix2HDR -- A pixel-wise acquisition and deep learning-based synthesis approach for high-speed HDR videos 2 days, 12 hours ago | arxiv.org

abstract acquisition applications arxiv +16

LuViRA Dataset Validation and Discussion: Comparing Vision, Radio, and Audio Sensors for Indoor Localization 2 days, 12 hours ago | arxiv.org

abstract algorithms analysis arxiv +17

Unsupervised Representation Learning for 3D MRI Super Resolution with Degradation Adaptation 2 days, 12 hours ago | arxiv.org

abstract arxiv cs.cv deep learning +16

Accurate Spatial Gene Expression Prediction by integrating Multi-resolution features 2 days, 12 hours ago | arxiv.org

abstract analysis arxiv costs +17

TIP-Editor: An Accurate 3D Editor Following Both Text-Prompts And Image-Prompts 2 days, 12 hours ago | arxiv.org

abstract arxiv attention control +10

Eyes Wide Shut? Exploring the Visual Shortcomings of Multimodal LLMs 2 days, 12 hours ago | arxiv.org

abstract arxiv capabilities clip +21

EAGLES: Efficient Accelerated 3D Gaussians with Lightweight EncodingS 2 days, 12 hours ago | arxiv.org

arxiv cs.cv cs.gr type

FRNet: Frustum-Range Networks for Scalable LiDAR Segmentation 2 days, 12 hours ago | arxiv.org

arxiv cs.cv cs.ro lidar +4

A Systematic Review of Deep Learning-based Research on Radiology Report Generation 2 days, 12 hours ago | arxiv.org

abstract arxiv automation clinical +18

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

View on ai-jobs.net

Research Scientist, Demography and Survey Science, University Grad

@ Meta | Menlo Park, CA | New York City

View on ai-jobs.net

Computer Vision Engineer, XR

@ Meta | Burlingame, CA

View on ai-jobs.net