AUTOMATIC1111 Stable Diffusion Web UI
Original authorAUTOMATIC1111
DevelopersAUTOMATIC1111 and community
Initial releaseAugustย 22, 2022; 3 years agoย (2022-08-22)[1]
Written inPython
LicenseAGPL-3.0[2]
Repositorygithub.com/AUTOMATIC1111/stable-diffusion-webui

AUTOMATIC1111 Stable Diffusion Web UI (SD WebUI, A1111, or Automatic1111[3]) is an open source generative artificial intelligence program that allows users to generate images from a text prompt.[4] It uses Stable Diffusion as the base model for its image capabilities together with a large set of extensions and features to customize its output.[5]

History

edit

SD WebUI was released on GitHub on August 22, 2022, by AUTOMATIC1111,[1] 1 month after the initial release of Stable Diffusion.[6] At the time, Stable Diffusion could only be run via the command line.[5] SD WebUI quickly rose in popularity and has been described as "the most popular tool for running diffusion models locally."[4][7] SD WebUI is one of the most popular user interfaces for Stable Diffusion, together with ComfyUI.[8] In February 2024, a book was published by ja:Gijutsu Hyoronsha on using Stable Diffusion with SD WebUI in Japanese.[9][10] As of July 2024, the project had 136,000 stars on GitHub.[11]

Features

edit

SD WebUI uses Gradio for its user interface.[12][13][14] Each parameter in the Stable Diffusion program is exposed via a UI interface within SD WebUI. SD WebUI contains additional parameters not included in Stable Diffusion itself, such as support for Low-rank adaptations, ControlNet and custom variational autoencoders.[12][13][15] SD WebUI supports prompt weighting, image-to-image based generation, inpainting, outpainting and image scaling.[16] It supports over 20 samplers including DDIM, Euler, Euler a, DPM++ 2M Karras, and UniPC.[16][17] It is also used for its various optimizations over the base Stable Diffusion.[5]

Stable Diffusion WebUI Forge

edit

Stable Diffusion WebUI Forge (Forge) is a notable fork of SD WebUI started by Lvmin Zhang, who is also the creator of ControlNet and Fooocus.[18][19] The initial goal of Forge was to improve the performance and features of SD WebUI with the intention to upstream changes back to SD WebUI.[18][19] One of Forge's optimizations allowed users with low VRAM to generate images faster on some versions of Stable Diffusion.[18] It improved generation speed for users with 8GB and 6GB VRAM by 30-45% and 60-75%, respectively.[18][19] Forge also includes extra features such as support for more samplers than standard SD WebUI.[20] Some of Forge's optimizations were borrowed from ComfyUI, and others were developed by the Forge team.[19] In August 2024, Forge added support for the Flux diffusion model developed by Black Forest Labs, which is not yet supported by SD WebUI.[21]

See also

edit

References

edit
  1. ^ a b AUTOMATIC1111 (Aug 22, 2022). "Initial commit". github.{{cite web}}: CS1 maint: numeric names: authors list (link)
  2. ^ AUTOMATIC1111 (Jan 15, 2023). "add license file". github. Retrieved 11 July 2024.{{cite web}}: CS1 maint: numeric names: authors list (link)
  3. ^ Brade, Stephen; Wang, Bryan; Sousa, Mauricio; Oore, Sageev; Grossman, Tovi (29 October 2023). "Promptify: Text-to-Image Generation through Interactive Prompt Exploration with Large Language Models". Proceedings of the 36th Annual ACM Symposium on User Interface Software and Technology. Association for Computing Machinery. pp.ย 1โ€“14. arXiv:2304.09337. doi:10.1145/3586183.3606725. ISBNย 979-8-4007-0132-0.
  4. ^ a b Mann, Tobias (29 Jun 2024). "A friendly guide to local AI image gen with Stable Diffusion and Automatic1111". The Register.
  5. ^ a b c Lewis, Nick (16 September 2022). "How to Run Stable Diffusion Locally With a GUI on Windows". How-To Geek. Retrieved 11 July 2024.
  6. ^ "Announcing SDXL 1.0". Stability AI. July 26, 2023.
  7. ^ Zhu, Andrew (2024). Using Stable Diffusion with Python: Leverage Python to control and automate high-quality AI image generation using Stable Diffusion. Packt Publishing. ISBNย 978-1835084311. Stable Diffusion WebUI from AUTO MATIC1111: This might be the most popular web-based application currently that allows users to generate images and text using Stable Diffusion. It provides a GUI interface that makes it easy to experiment with different settings and parameters
  8. ^ Hu, Qihan; Xu, Zhenghui; Du, Peng; Zeng, Hao; Ma, Tongqing; Zhao, Youbing; Xie, Hao; Zhang, Peng; Liu, Shuting; Zang, Tongnian; Wang, Xuemei (2024). "CanFuUI: A Canvas-Centric Web User Interface for Iterative Image Generation with Diffusion Models and ControlNet". AI-generated Content. Communications in Computer and Information Science. Vol.ย 1946. Springer Nature Singapore. pp.ย 128โ€“138. doi:10.1007/978-981-99-7587-7_11. ISBNย 978-981-99-7586-0. Currently, the most popular user interfaces for Stable Diffusion are Stable Diffusion WebUI and ComfyUI.
  9. ^ ๅคงๅดŽ, ้ก•; ๆฐดๅฃ, ็‘›ไป‹ (23 March 2024). ใฏใ˜ใ‚ใฆใงใ‚‚ใ“ใ“ใพใงใงใใ‚‹ Stable Diffusion็”ปๅƒ็”Ÿๆˆ๏ผปๆœฌๆ ผ๏ผฝๆดป็”จใ‚ฌใ‚คใƒ‰ (in Japanese). ja:ๆŠ€่ก“่ฉ•่ซ–็คพ. ISBNย 978-4-297-14083-0.
  10. ^ ใ‚ใ‚ใ—ใ‚ใ„ใใ‚„ (12 June 2024). "็ฌฌ817ๅ›ž ๅ‚่€ƒๆ›ธใ‚’็‰‡ๆ‰‹ใซUbuntuใงใ‚‚Stable Diffusion WebUIใ‚’ๅ‹•ไฝœใ•ใ›ใ€็”ปๅƒใ‚’็”Ÿๆˆใ™ใ‚‹". gihyo.jp (in Japanese). ja:ๆŠ€่ก“่ฉ•่ซ–็คพ.
  11. ^ AUTOMATIC1111 (August 2022). "Stable Diffusion Web UI". github.{{cite web}}: CS1 maint: numeric names: authors list (link)
  12. ^ a b Wang, Chenghao; Chung, Jeanhun (30 June 2023). "Research on AI Painting Generation Technology Based on the [Stable Diffusion]". International Journal of Advanced Smart Convergence. 12 (2): 90โ€“95. doi:10.7236/IJASC.2023.12.2.90. Stable Diffusion Web UI is a browser interface based on the Gradio library,
  13. ^ a b Kim, Seonuk; Ko, Taeyoung; Kwon, Yousang; Lee, Kyungho (9 October 2023). "Designing interfaces for text-to-image prompt engineering using stable diffusion models: a human-AI interaction approach". IASDR Conference Series. doi:10.21606/iasdr.2023.448. ISBNย 978-1-912294-59-6.
  14. ^ Hook, Steve (10 January 2024). "Stable Diffusion WebUI - Run SDXL locally with the AUTOMATIC1111 GUI". PC Guide.
  15. ^ Pocock, Kevin (16 August 2023). "Stable Diffusion: How to Use VAE". PC Guide. Retrieved 11 July 2024.
  16. ^ a b Phoenix, James; Taylor, Mike (2024). "AUTOMATIC1111 Web User Interface". Prompt engineering for generative AI: future-proof inputs for reliable AI outputs at scale (Firstย ed.). Beijing Boston: O'Reilly. ISBNย 978-1098153434.
  17. ^ Zhang, Jing; Jiang, Yan (June 2023). "Style Transfer Technology of Batik Pattern Based on Deep Learning". Journal of Fiber Bioengineering and Informatics. 16 (1): 57โ€“67. doi:10.3993/jfbim02171.
  18. ^ a b c d ่ฅฟๅท ๅ’Œไน… (14 February 2024). "ใ€่ฅฟๅทๅ’Œไน…ใฎไธๅฎšๆœŸใ‚ณใƒฉใƒ ใ€‘ VRAMใŒๅฐ‘ใชใ„GPUใง็”ปๅƒ็”ŸๆˆAIใ‚’่ซฆใ‚ใฆใ„ใŸไบบใซใ€‚ใ€ŒStable Diffusion WebUI Forgeใ€็™ปๅ ด๏ผ". PC Watch (in Japanese).
  19. ^ a b c d ๆ–ฐๆธ…ๅฃซ (February 26, 2024). "็”ปๅƒ็”ŸๆˆAIใ€ๅฎ‰ใ„PCใงใ‚‚้ซ˜้€Ÿใซ ่กๆ’ƒใฎใ€ŒStable Diffusion WebUI Forgeใ€ (1/4)". ASCII.jp (in Japanese).
  20. ^ Horsey, Julian (14 February 2024). "Stable Diffusion WebUI Forge up to 75% faster than Automatic 1111 and ComfyUI". Geeky Gadgets.
  21. ^ ็”ฐๅฃๅ’Œ่ฃ• (August 18, 2024). "่ฉฑ้กŒใฎ็”ปๅƒ็”ŸๆˆAIใ€ŒFLUX.1ใ€ใ‚’Stable Diffusion็”จใฎใ€ŒWebUI Forgeใ€ใงๅ‹•ใ‹ใ™๏ผˆ้ซ˜้€ŸๅŒ–ใ‚‚่ฉฆใ—ใฆใฟใพใ—ใŸ๏ผ‰ (1/6)". ASCII.jp (in Japanese).

๐Ÿ“š Artikel Terkait di Wikipedia

Stable Diffusion

called StableStudio. In addition to Stability's interfaces, many third party open source interfaces exist, such as AUTOMATIC1111 Stable Diffusion Web UI, which

ComfyUI

stars on GitHub. ComfyUI is one of the most popular user interfaces for Stable Diffusion, along with Automatic1111. ComfyUI's main feature is that it

Flux (text-to-image model)

generative AI user interfaces such as ComfyUI, Recraft Studio and Stable Diffusion WebUI Forge (a fork of Automatic1111 WebUI). Related to Flux is a text-to-video

AI art

Stability AI has a Stable Diffusion web interface called DreamStudio, plugins for Krita, Photoshop, Blender, and GIMP, and the Automatic1111 web-based open source

Synthetic media

Inversion". arXiv:2208.01618 [cs.CV]. "Textual Inversion ยท AUTOMATIC1111/stable-diffusion-webui Wiki". GitHub. Archived from the original on February