Promptless Task-Specific Finetuning of MetaAI Segment-Anything | allainews.com

Jan. 7, 2024, 10:49 a.m. | /u/randomnes-random

Computer Vision www.reddit.com

# Task:

**Finetune SAM model on Custom dataset to segment objects without prompts (during training and inference)**

# Approach:

https://preview.redd.it/8h0xgyk500bc1.png?width=1333&format=png&auto=webp&s=839212aaf8ab209ea0e4eadebaec9d3467c4df4c

>Note: The post is created using my Kaggle notebook -- [https://www.kaggle.com/code/yogendrayatnalkar/promptless-taskspecific-finetuning-of-metaai-sam](https://www.kaggle.com/code/yogendrayatnalkar/promptless-taskspecific-finetuning-of-metaai-sam)

## How does SAM work (high-level):

* Sam Encoder --> **ViT + Neck-Module** (Consisting of 2 Conv2D layers used for downsampling the channels of the ViT output)
* The Encoder ViT has a patch-size of **16x16**.
* Input: **1024x1024x3**
* With the above patch-size and input-image-size, the …

computervision dataset downsampling encoder finetuning inference objects prompts sam segment training vit work

More from www.reddit.com / Computer Vision

My New project . open cv real time face and emotion recognation. drop ur thought … 11 hours ago | www.reddit.com

computervision emotion face project +1

Developing Software vs Off the Shelf 20 hours ago | www.reddit.com

computervision industry manufacturing opencv +5

YOLOv8 TensorRT based on the references provided by Ultralytics 22 hours ago | www.reddit.com

case computervision jetson jetson orin +4

CNN vs. Vision Transformer: A Practitioner's Guide to Selecting the Right Model 1 day, 1 hour ago | www.reddit.com

architecture blog cnn computervision +12

Processing 80 camera streams on a single rack-mounted server - anyone worked on a similar … 1 day, 17 hours ago | www.reddit.com

application cameras computervision decoding +7

Predicting the real world coordinates (x,y,z) of a ball from 2d image taken from a … 1 day, 20 hours ago | www.reddit.com

2d image box center computervision +7

2024 review of OCR tools extracting text from handwritten forms and documents 1 day, 22 hours ago | www.reddit.com

case computervision documents example +10

Looking for Recent Visual Programming Tools for Computer Vision 2 days, 1 hour ago | www.reddit.com

advance coding computer computer vision +13

Multi box localization 2 days, 3 hours ago | www.reddit.com

box computervision experience extract +10

Software Engineer for AI Training Data (School Specific)

@ G2i Inc | Remote

View on ai-jobs.net

Software Engineer for AI Training Data (Python)

@ G2i Inc | Remote

View on ai-jobs.net

Software Engineer for AI Training Data (Tier 2)

@ G2i Inc | Remote

View on ai-jobs.net

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

View on ai-jobs.net

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

View on ai-jobs.net

Lead Developer (AI)

@ Cere Network | San Francisco, US

View on ai-jobs.net