[P] ImageBind with SAM: A simple demo the generate mask with different modalities | allainews.com

May 16, 2023, 5:18 p.m. | /u/Technical-Vast1314

Machine Learning www.reddit.com

## ImageBind with SAM

We build a simple demo [ImageBind-SAM](https://github.com/IDEA-Research/Grounded-Segment-Anything/tree/main/playground/ImageBind_SAM) here which aims to segment with different modalities

The basic idea is as follows:

* Step 1: Generate auto masks with `SamAutomaticMaskGenerator`
* Step 2: Crop all the generated regions from the masks
* Step 3: Compute the similarity with cropped images with different modalities
* Step 4: Merge the highest similarity mask region

And the result is shown as:

https://preview.redd.it/e4ifzuk1980b1.png?width=1282&format=png&auto=webp&v=enabled&s=dfea6ddb1513007792819c944f3d688341a4d1e6

And the threshold for keeping the similar regions will …

compute demo generated imagebind images machinelearning masks merge sam

More from www.reddit.com / Machine Learning

[N] Snowflake releases open (Apache 2.0) 128x3B MoE model 8 hours ago | www.reddit.com

apache apache 2.0 machinelearning moe +2

[D] Why would such a simple sentence break an LLM? 8 hours ago | www.reddit.com

copilot disadvantages german gpt4 +7

[R] I made an app to predict ICML paper acceptance from reviews 12 hours ago | www.reddit.com

analysis conferences iclr machinelearning +6

[R] SpaceByte: Towards Deleting Tokenization from Large Language Modeling - Rice University 2024 - Practically … 13 hours ago | www.reddit.com

abstract machinelearning

[D] Keeping track of models and their associated metadata. 14 hours ago | www.reddit.com

industry machinelearning metadata project +1

[D] How researcher think of inductive bias when thinking of creating new/improving foundational models? 22 hours ago | www.reddit.com

bias foundational foundational models improving +14

[R] Generalized Contrastive Learning for Multi-Modal Retrieval and Ranking 1 day, 1 hour ago | www.reddit.com

clip documents encode generalized +15

[D] Practical uses of AI inside companies 1 day, 2 hours ago | www.reddit.com

ai inside companies concrete course +17

Meta does everything OpenAI should be [D] 1 day, 2 hours ago | www.reddit.com

become capabilities commercial everything +9

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

View on ai-jobs.net

Social Insights & Data Analyst (Freelance)

@ Media.Monks | Jakarta

View on ai-jobs.net

Cloud Data Engineer

@ Arkatechture | Portland, ME, USA

View on ai-jobs.net