[CVPR'24] LLM4SGG: Large Language Models for Weakly Supervised Scene Graph Generation | allainews.com

April 4, 2024, 3:33 a.m. | /u/kb_kim

machinelearningnews www.reddit.com

It is the first work to leverage a **Large Langage Model** on Scene Graph Generation task.
Incredibly, we achieve comparable performance to a fully supervised approach in terms of F@K, even when we only use **image captions** in Scene Graph Generation task.
For more details, refer to

paper: [https://arxiv.org/pdf/2310.10404.pdf](https://arxiv.org/pdf/2310.10404.pdf)

code: [https://github.com/rlqja1107/torch-LLM4SGG](https://github.com/rlqja1107/torch-LLM4SGG)

[Overall Framework](https://preview.redd.it/5fmqbz9dsdsc1.png?width=1065&format=png&auto=webp&s=6a72e722b589fccfad01e8152fd9c604a1587931)

[Performance Comparison](https://preview.redd.it/0vv7ll85tdsc1.png?width=1241&format=png&auto=webp&s=9b15139b629f5181f0c0e4623ee1fa3f0b8e1113)

captions cvpr graph image language language models large language large language models machinelearningnews performance terms work

More from www.reddit.com / machinelearningnews

Meet Verba 1.0: Run State-of-the-Art RAG Locally with Ollama Integration and Open Source Models 7 hours ago | www.reddit.com

art integration machinelearningnews ollama +3

Researchers from Columbia University and Databricks Conducted a Comparative Study of LoRA and Full Finetuning … 1 day, 5 hours ago | www.reddit.com

adjusting columbia columbia university comparative study +18

01.AI Introduces Yi-1.5-34B Model: An Upgraded Version of Yi with a High-Quality Corpus of 500B … 1 day, 22 hours ago | www.reddit.com

machinelearningnews

Meta AI Introduces Chameleon: A New Family of Early-Fusion Token-based Foundation Models that Set a … 2 days, 4 hours ago | www.reddit.com

architecture document enabling family +21

GeoDiffuser: A Zero shot optimization-based method to perform common 2D and 3D image editing tasks … 2 days, 7 hours ago | www.reddit.com

editing image inpainting machinelearningnews +8

Researchers from Cerebras & Neural Magic Introduce Sparse Llama: The First Production LLM based on … 2 days, 8 hours ago | www.reddit.com

austria cerebras cerebras systems create +18

FREE AI WEBINAR from our Partners: 'How to Build Local LLM Apps with Ollama & … 2 days, 10 hours ago | www.reddit.com

ai webinar apps build free +10

SpeechVerse: A Multimodal AI Framework that Enables LLMs to Follow Natural Language Instructions for Performing … 2 days, 11 hours ago | www.reddit.com

ai framework diverse framework language +9

Tired of MMLU? The current models already hit the ceiling? It's time to upgrade MMLU! … 3 days, 7 hours ago | www.reddit.com

benchmark benchmarking capabilities current +13

Software Engineer for AI Training Data (School Specific)

@ G2i Inc | Remote

View on ai-jobs.net

Software Engineer for AI Training Data (Python)

@ G2i Inc | Remote

View on ai-jobs.net

Software Engineer for AI Training Data (Tier 2)

@ G2i Inc | Remote

View on ai-jobs.net

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

View on ai-jobs.net

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

View on ai-jobs.net

Lead Developer (AI)

@ Cere Network | San Francisco, US

View on ai-jobs.net