Web: https://www.reddit.com/r/computervision/comments/uohadu/meta_ai_introduces_makeascene_a_deep_generative/

May 13, 2022, 2:13 a.m. | /u/No_Coffee_4638

Computer Vision reddit.com

In recent years, the research related to text-to-image generation has been growing exponentially. Nevertheless, the current methods still lack at least three essential characteristics. First of all, most models accept as input solely the text information. This is a massive limitation, as the controllability of the model is limited to style or color, but it can not be extended to structure or form, for example. The second limitation is related to human perception: indeed, the final aim of these models …

ai computervision deep human image meta meta ai on text text-to-image transformer

