May 16, 2023, 5:18 p.m. | /u/Technical-Vast1314

Machine Learning www.reddit.com

## ImageBind with SAM

We build a simple demo [ImageBind-SAM](https://github.com/IDEA-Research/Grounded-Segment-Anything/tree/main/playground/ImageBind_SAM) here which aims to segment with different modalities

The basic idea is as follows:

* Step 1: Generate auto masks with `SamAutomaticMaskGenerator`
* Step 2: Crop all the generated regions from the masks
* Step 3: Compute the similarity with cropped images with different modalities
* Step 4: Merge the highest similarity mask region

And the result is shown as:

https://preview.redd.it/e4ifzuk1980b1.png?width=1282&format=png&auto=webp&v=enabled&s=dfea6ddb1513007792819c944f3d688341a4d1e6

And the threshold for keeping the similar regions will …

compute demo generated imagebind images machinelearning masks merge sam

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Social Insights & Data Analyst (Freelance)

@ Media.Monks | Jakarta

Cloud Data Engineer

@ Arkatechture | Portland, ME, USA