Web: https://www.reddit.com/r/computervision/comments/vfgn3d/harvard_researchers_introduce_a_novel_vit/

June 18, 2022, 10:11 p.m. | /u/No_Coffee_4638

Computer Vision reddit.com

🚦 HIPT is pretrained across 33 cancer types using 10,678 gigapixel WSIs, 408,218 4096×4096 images, and 104M 256 × 256 images

🚦 HIPT pushes the boundaries of both Vision Transformers and self-supervised learning in two important ways.

🚦 The code is available

[Continue reading](https://www.marktechpost.com/2022/06/18/harvard-researchers-introduce-a-novel-vit-architecture-called-hierarchical-image-pyramid-transformer-hipt-that-can-scale-vision-transformers-to-gigapixel-images-via-hierarchical-self-supervised-lear/) | *Checkout the* [*paper*](https://arxiv.org/pdf/2206.02647.pdf)*,* [*github*](https://github.com/mahmoodlab/HIPT)

​

https://i.redd.it/c0ivcnxbeg691.gif

architecture computervision harvard hierarchical image images learning researchers scale self-supervised learning supervised learning transformer transformers vision

Machine Learning Researcher - Saalfeld Lab

@ Howard Hughes Medical Institute - Chevy Chase, MD | Ashburn, Virginia

Project Director, Machine Learning in US Health

@ ideas42.org | Remote, US

Data Science Intern

@ NannyML | Remote

Machine Learning Engineer NLP/Speech

@ Play.ht | Remote

Research Scientist, 3D Reconstruction

@ Yembo | Remote, US

Clinical Assistant or Associate Professor of Management Science and Systems

@ University at Buffalo | Buffalo, NY