July 6, 2023, 2:11 a.m. | /u/lostintoomanyfandoms

Computer Vision www.reddit.com

Hey! I am a beginner at this and I'd like to see how DCGAN Architecture would work for Text to Image. I have added a text embedding module within the Generator and Discriminator. While the batch output seems to be improving over epochs, the model doesn't work well on unseen text. It seems to generate the same image every time. I am using the CUBS dataset with approx 10 captions for each image. Can someone help me understand what is …

architecture beginner computervision dcgan embedding generator hey image text work

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

Lead Developer (AI)

@ Cere Network | San Francisco, US

Research Engineer

@ Allora Labs | Remote

Ecosystem Manager

@ Allora Labs | Remote

Founding AI Engineer, Agents

@ Occam AI | New York

AI Engineer Intern, Agents

@ Occam AI | US