all AI news
[CVPR'24] LLM4SGG: Large Language Models for Weakly Supervised Scene Graph Generation
April 4, 2024, 3:33 a.m. | /u/kb_kim
machinelearningnews www.reddit.com
Incredibly, we achieve comparable performance to a fully supervised approach in terms of F@K, even when we only use **image captions** in Scene Graph Generation task.
For more details, refer to
paper: [https://arxiv.org/pdf/2310.10404.pdf](https://arxiv.org/pdf/2310.10404.pdf)
code: [https://github.com/rlqja1107/torch-LLM4SGG](https://github.com/rlqja1107/torch-LLM4SGG)
[Overall Framework](https://preview.redd.it/5fmqbz9dsdsc1.png?width=1065&format=png&auto=webp&s=6a72e722b589fccfad01e8152fd9c604a1587931)
[Performance Comparison](https://preview.redd.it/0vv7ll85tdsc1.png?width=1241&format=png&auto=webp&s=9b15139b629f5181f0c0e4623ee1fa3f0b8e1113)
captions cvpr graph image language language models large language large language models machinelearningnews performance terms work
More from www.reddit.com / machinelearningnews
Jobs in AI, ML, Big Data
Software Engineer for AI Training Data (School Specific)
@ G2i Inc | Remote
Software Engineer for AI Training Data (Python)
@ G2i Inc | Remote
Software Engineer for AI Training Data (Tier 2)
@ G2i Inc | Remote
Data Engineer
@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania
Artificial Intelligence – Bioinformatic Expert
@ University of Texas Medical Branch | Galveston, TX
Lead Developer (AI)
@ Cere Network | San Francisco, US