all AI news
D$^3$ETR: Decoder Distillation for Detection Transformer. (arXiv:2211.09768v1 [cs.CV])
Nov. 18, 2022, 2:14 a.m. | Xiaokang Chen, Jiahui Chen, Yan Liu, Gang Zeng
cs.CV updates on arXiv.org arxiv.org
While various knowledge distillation (KD) methods in CNN-based detectors show
their effectiveness in improving small students, the baselines and recipes for
DETR-based detectors are yet to be built. In this paper, we focus on the
transformer decoder of DETR-based detectors and explore KD methods for them.
The outputs of the transformer decoder lie in random order, which gives no
direct correspondence between the predictions of the teacher and the student,
thus posing a challenge for knowledge distillation. To this end, …
More from arxiv.org / cs.CV updates on arXiv.org
Compact 3D Scene Representation via Self-Organizing Gaussian Grids
2 days, 8 hours ago |
arxiv.org
Fingerprint Matching with Localized Deep Representation
2 days, 8 hours ago |
arxiv.org
Jobs in AI, ML, Big Data
Founding AI Engineer, Agents
@ Occam AI | New York
AI Engineer Intern, Agents
@ Occam AI | US
AI Research Scientist
@ Vara | Berlin, Germany and Remote
Data Architect
@ University of Texas at Austin | Austin, TX
Data ETL Engineer
@ University of Texas at Austin | Austin, TX
Lead GNSS Data Scientist
@ Lurra Systems | Melbourne