all AI news
S2WAT: Image Style Transfer via Hierarchical Vision Transformer using Strips Window Attention. (arXiv:2210.12381v2 [cs.CV] UPDATED)
cs.CV updates on arXiv.org arxiv.org
This paper presents a new hierarchical vision Transformer for image style
transfer, called Strips Window Attention Transformer (S2WAT), which serves as
an encoder of encoder-transfer-decoder architecture. With hierarchical
features, S2WAT can leverage proven techniques in other fields of computer
vision, such as feature pyramid networks (FPN) or U-Net, to image style
transfer in future works. However, the existing window-based Transformers will
cause a problem that the stylized images will be grid-like when introduced into
image style transfer directly. To solve …
arxiv attention hierarchical image style transfer transfer transformer vision