Sauerworld Forum

Sauerbraten Talk => General Chat => Topic started by: Salatiel on May 06, 2022, 02:51:01 AM

Title: Using AI to manipulate Sauerbraten screenshots
Post by: Salatiel on May 06, 2022, 02:51:01 AM
In 2021 OpenAI published DALL-E, a neural network capable of manipulating and generating images from text prompts (https://openai.com/dall-e-2/ (https://openai.com/dall-e-2/)), it is still only available through a waitlist, but since then our dear open source people have come up with alternatives that we can use for free, and what do you do when you get your hands on one of these tools? you find a way to put Sauer in it of course :P
below I'll leave some experiments I did modifying Sauer screenshots using only text:

An even more futuristic Xenon base
(https://i.imgur.com/VnQuqFB.png)
(https://i.imgur.com/thwbGhe.png)(https://i.imgur.com/Mjlbpej.png)
There are other variables you can apply in addition to the text prompt, so some results may look weirder.
Here is a landed spaceship:
(https://i.imgur.com/yZWrcAH.png)(https://i.imgur.com/aOdWI5L.png)

Venezified venice
(https://i.imgur.com/o4axyDB.png)
if the canals became paved roads:
(https://i.imgur.com/Tdd1AjU.png)(https://i.imgur.com/GzhnoiO.png)
(https://i.imgur.com/tyR4X3L.png)(https://i.imgur.com/Q41RbQ5.png)

Turbined turbine
(https://i.imgur.com/6PEvfzD.png)
(https://i.imgur.com/0irBvK1.png)(https://i.imgur.com/Viy9fGr.png)
(https://i.imgur.com/zOrZvzy.png)(https://i.imgur.com/iaG0Ky7.png)

Snow and rain in urban_c
(https://i.imgur.com/fPODINR.png)
(https://i.imgur.com/HwSZsTQ.png)(https://i.imgur.com/m37zwKZ.png)
(https://i.imgur.com/xoYnWhC.png)(https://i.imgur.com/0h5gwbZ.png)

Modern architecture in urban_c
(https://i.imgur.com/sMqtuJ9.png)
(https://i.imgur.com/TOigAJF.png)(https://i.imgur.com/zbCs3wn.png)
(https://i.imgur.com/zoMY71T.png)(https://i.imgur.com/GnbjSz7.png)

Rainy alley in kopenhagen
(https://i.imgur.com/cdccelp.png)

Police car in ghetto
(https://i.imgur.com/uUH1OGg.png)

You can also use the result as input multiple times and get something that is more distinct from the original image, here's a screenshot of a race map made by TristamK shaped like a subway with people walking in it:
(https://i.imgur.com/fgeDf15.png)

Some more random tests:
Bedroom in CpHills (Cooper)
(https://i.imgur.com/OMqpudf.png)

Cooper's motorcycle
(https://i.imgur.com/9cONDcH.png)

Synderf's tank
(https://i.imgur.com/PvgLFcO.png)
(https://i.imgur.com/l8yw4TJ.png)(https://i.imgur.com/Q6Ep61X.png)

Galaxy's restaurant kitchen
(https://i.imgur.com/KDswStR.png)
(https://i.imgur.com/9dAR230.png)(https://i.imgur.com/JUJaVIM.png)

Razgriz's triforts
(https://i.imgur.com/I9ThU9D.png)
(https://i.imgur.com/0R2ABOM.png)
(https://i.imgur.com/N5eyVoM.png)

If you want to try it on your own without a super computer, there are some "Image Generation Models" available on Google Colab and Kaggle, some require you to have a pro account on these platforms, others like the one I'm using (Lite's Latent Diffusion v9 (https://www.kaggle.com/code/litevex/lite-s-latent-diffusion-v9-with-gradio/notebook)) just need a verified phone number.

Bonus mm-auggiecat.jpg
(https://i.imgur.com/9ksDVJY.png)