all AI news
CLIPSwarm: Generating Drone Shows from Text Prompts with Vision-Language Models
March 21, 2024, 4:46 a.m. | Pablo Pueyo, Eduardo Montijano, Ana C. Murillo, Mac Schwager
cs.CV updates on arXiv.org arxiv.org
Abstract: This paper introduces CLIPSwarm, a new algorithm designed to automate the modeling of swarm drone formations based on natural language. The algorithm begins by enriching a provided word, to compose a text prompt that serves as input to an iterative approach to find the formation that best matches the provided word. The algorithm iteratively refines formations of robots to align with the textual description, employing different steps for "exploration" and "exploitation". Our framework is currently …
abstract algorithm arxiv automate cs.cv cs.ro drone iterative language language models modeling natural natural language paper prompt prompts shows text the algorithm type vision vision-language models word
More from arxiv.org / cs.CV updates on arXiv.org
Jobs in AI, ML, Big Data
Data Architect
@ University of Texas at Austin | Austin, TX
Data ETL Engineer
@ University of Texas at Austin | Austin, TX
Lead GNSS Data Scientist
@ Lurra Systems | Melbourne
Senior Machine Learning Engineer (MLOps)
@ Promaton | Remote, Europe
Data Scientist
@ Publicis Groupe | New York City, United States
Bigdata Cloud Developer - Spark - Assistant Manager
@ State Street | Hyderabad, India