[P] (code release) Fine-tune your own stable-diffusion vae decoder and dalle-mini decoder | allainews.com

Sept. 12, 2022, 3:30 p.m. | /u/cccntu

Machine Learning www.reddit.com

A few weeks ago, before stable-diffusion was officially released, I found that fine-tuning Dalle-mini's VQGAN decoder can improve the performance on anime images. See:

https://preview.redd.it/eekf9hjt3gn91.png?width=1280&format=png&auto=webp&s=25938a4ad284e6cfff958ad0d69968cd2c01ed18

And with a few lines of code change, I was able to train the stable-diffusion VAE decoder. See:

https://preview.redd.it/45xogflo5gn91.png?width=1129&format=png&auto=webp&s=43f98e863b918bba9d7471a0cfa7de4dcc8df98c

You can find the exact training code used in this repo: [https://github.com/cccntu/fine-tune-models/](https://github.com/cccntu/fine-tune-models/)

More details about the models are also in the repo.

And you can play with the former model at [https://github.com/cccntu/anim\_e](https://github.com/cccntu/anim_e)

code dalle dalle-mini diffusion machinelearning release

More from www.reddit.com / Machine Learning

[P] [D] Is inference time the important performance metric for ML Models on edge/mobile? 6 hours ago | www.reddit.com

apps devices edge embed +15

[D] Any-dimensional equivariant neural networks 8 hours ago | www.reddit.com

abstract assumptions authors cases +18

How are large network attack datasets made? [p] 12 hours ago | www.reddit.com

attacks datasets detection free +5

A Multi-Agent game where LLMs must trick each other as humans until one gets caught … 15 hours ago | www.reddit.com

agent fun game humans +7

[D] How reliable is RAG currently? 15 hours ago | www.reddit.com

context context window documents machinelearning +5

[N] New Challenges in DIAMBRA Arena: 3 epic additions to our lineup of RL environments! 16 hours ago | www.reddit.com

arena challenges environments epic +1

[R] An Analysis of Linear Time Series Forecasting Models 18 hours ago | www.reddit.com

abstract analysis forecasting form +9

[D] The "it" in AI models is really just the dataset? 18 hours ago | www.reddit.com

ai models dataset machinelearning

[D] Analysis of Time To First Token (TTFT) of LLMs (10B-34B) 20 hours ago | www.reddit.com

analysis containers docker hey +10

Founding AI Engineer, Agents

@ Occam AI | New York

View on ai-jobs.net

AI Engineer Intern, Agents

@ Occam AI | US

View on ai-jobs.net

AI Research Scientist

@ Vara | Berlin, Germany and Remote

View on ai-jobs.net

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net