Info-icon.png We have moved to https://openmodeldb.info/. This new site has a tag and search system, which will make finding the right models for you much easier! If you have any questions, ask here: https://discord.gg/cpAUpDK

Difference between revisions of "Using ESRGAN, Links, And Other Information"

From Upscale Wiki
Jump to navigation Jump to search
(Add DirectML Tutorial Link, specified that it's WIP // Edit via Wikitext Extension for VSCode)
 
(22 intermediate revisions by 4 users not shown)
Line 15: Line 15:
  
 
=== GUI Tools ===
 
=== GUI Tools ===
 +
----
 +
==== Cupscale [https://upscale.wiki/wiki/Cupscale_Tutorial (Wiki)] [https://github.com/n00mkrad/cupscale (GitHub)] ====
 +
----
 +
Cupscale is a young ESRGAN GUI that's built with usability in mind.
  
==== [https://github.com/ptrsuder/IEU.Winforms ptrsuder's IEU (Image Enhancement Utility)] ====
+
===== Advantages =====
 +
 
 +
* Run ESRGAN on CPU, Nvidia GPU or even AMD GPUs using Vulkan
 +
* Interpolate models on-the-fly (no need to create extra model files)
 +
* Create multiple types of comparisons
 +
* Batch upscale entire folders
 +
* Upscale GIFs and even videos
 +
* Multiple seamless modis are available
 +
* Supports many image formats including JPEG, PNG, WEBP, DDS, and more
 +
 
 +
===== Disadvantages =====
 +
 
 +
* No cross platform support
  
IEU is the recommended option out there right now, it has been used by many people and is considered mature, but if you encounter bugs or have feature requests, feel free to report them on Discord, or on GitHub repository by creating an issue.
+
==== IEU - Image Enhancement Utility [https://github.com/ptrsuder/IEU.Winforms (GitHub)] ====
 +
----
 +
IEU has been used by many people and is considered mature, but if you encounter bugs or have feature requests, feel free to report them on Discord, or on GitHub repository by creating an issue.
 
The '''Download''' is under the [https://github.com/ptrsuder/IEU.Winforms/ IEU.Winforms] Releases tab.
 
The '''Download''' is under the [https://github.com/ptrsuder/IEU.Winforms/ IEU.Winforms] Releases tab.
 
Please take a bit of your time to read [https://github.com/ptrsuder/IEU.Winforms/wiki/Basic-usage IEU wiki] first.
 
Please take a bit of your time to read [https://github.com/ptrsuder/IEU.Winforms/wiki/Basic-usage IEU wiki] first.
  
===== Advantages: =====
+
===== Advantages =====
  
* Easy to use
 
 
* Process any amount of images with all models you select via check-boxes
 
* Process any amount of images with all models you select via check-boxes
 
* Control over the output format and naming scheme
 
* Control over the output format and naming scheme
* Works around VRAM limitations
 
* Has a lot of advanced features available without the need to write any scripts
 
* Can upscale seamless(tiled) textures (landscape textures for example)
 
 
* Can use a different model for the alpha channel
 
* Can use a different model for the alpha channel
 
* Can upscale each channel separately (Red, Green, Blue and Alpha)
 
* Can upscale each channel separately (Red, Green, Blue and Alpha)
 
*: This is useful for non diffuse textures, so normal maps (_n.dds typically), specular maps, ...
 
*: This is useful for non diffuse textures, so normal maps (_n.dds typically), specular maps, ...
 
* Can interpolate different models right in the GUI
 
* Can interpolate different models right in the GUI
* Uses panorama image stitching to merge images. That makes it work with images of any resolution without cropping.
 
  
 
===== Disadvantages =====
 
===== Disadvantages =====
  
 
* No cross platform support
 
* No cross platform support
* GUI only, no CLI support
 
* Uses panorama image stitching to merge images. That makes it slower and blends slightly worse. Most people will likely not notice the difference.
 
  
 
=== CLI Tools ===
 
=== CLI Tools ===
 
+
----
==== [https://github.com/deorder/texture-upscaler Deorder's scripts] ====
+
==== Joey's ESRGAN [https://upscale.wiki/wiki/Joey's_ESRGAN_Tutorial (Wiki)] [https://github.com/JoeyBallentine/ESRGAN (GitHub)] ====
 
+
----
 
===== Advantages =====
 
===== Advantages =====
  
 
* Cross-Platform
 
* Cross-Platform
 
* Works on headless systems
 
* Works on headless systems
* Script for splitting images
+
* Auto splits and merges images in memory
* Script for merging images (to work around VRAM limitations)
+
* Multiple seamless modis are available
* Barebones, you can use them for a lot of other Neural Networks not just BasicSR / ESRGAN
+
* It can chain models
* Barebones, so you can script whatever you want around them
+
* As with any CLI tool, you can easily script around it to add further features.
 
 
===== Disadvantages =====
 
 
 
* Outdated, they use ImageMagick 6
 
* Problems when using it for images with dimensions not dividable though the patch size (by default 256x256)
 
* '''Don’t use [https://en.wikipedia.org/wiki/Windows_Subsystem_for_Linux WSL (Windows Subsystem for Linux)]''' if you run ESRGAN on a '''NVIDIA''' GPU since WSL '''doesn’t support GPU acceleration''', which makes it unable to run ESRGAN in CUDA mode. Use git bash or something similar instead.
 
 
 
===== Requirements =====
 
 
 
* bash (We install that via the windows guide. MacOS and Linux tend to come with it out of the box, no need to do anything for those there)
 
* imagemagick
 
 
 
==== Blue Amulet's Esrgan test.py script (Guide way) ====
 
 
 
 
 
===== Advantages =====
 
 
 
* Nothing additional needs to be installed
 
* Fastest option out there
 
  
 
===== Disadvantages =====
 
===== Disadvantages =====
  
* Only works for some image formats (yes, jpg and png work)
+
* You have to enter the full model names if you don't wrap a script around it
* No splitting of images, so you can run out of VRAM easily as images get bigger
+
* CLIs can feel intimidating to those unused to them
* You are limited by the amount of channels a model supports. Nearly all models out there require images to have exactly 3 channels (RGB).
 
 
 
===== How to use it =====
 
  
# Put the pictures or textures you want to upscale into the <code>LR</code> folder
+
===== Using DirectML with Joey's fork =====
# If you want to use a model with a scale other than 4, you need to edit the test.py file. Just open it with your text editor and change the scale to the one from your model. The scale of each model is documented on our wiki. If you want to run an artifact removal model, like my jpg model for example change <code>scale=4</code> to <code>scale=1</code>
+
[https://upscale.wiki/wiki/Using_DirectML_with_ESRGAN Work in Progress Tutorial]
# Open a terminal window and navigate to the <code>esrgan</code> folder
 
#* For Windows Shift right click in your <code>esrgan</code> folder and select <code>Open PowerShell window here</code>. For some users it might say <code>Command Line</code> instead, if that is the case for you click on that and procced
 
#* For Linux / MacOS users the process is similar. File managers like Nautilus for example allow you to open a terminal in a folder. If that isn't an option for you can can also navigate using commands. <code>cd</code> allows you to navigate and pwd shows you the current folder. <code>cd ..</code> goes one directory down (for example from <code>/home/combi/code/git/ctp</code> to <code>/home/combi/code/git</code>). Use <code>cd /path/to/whatever</code> to navigate to absolute (full) paths or <code>cd some-folder-in-the-current-folder</code> to navigate to a folder in the current open folder. (<code>pwd</code> = print working directory; cd = change directory) If you want to find out more about a command you can just type <code>man the command-you-want-to-know-about</code> or use the internet
 
# Enter:
 
## For Nvidia GPUs
 
##: <code>python test.py models/${theModelYouWantToUse}</code>
 
## For other GPUs / integrated Graphic
 
##: <code>python test.py --cpu models/${theModelYouWantToUse}</code>
 
# Don't enter <code>${theModelYouWantToUse}</code> Instead replace that with the name of a model of course As an example, for the default model it would be: <code>python test.py models/RRDB_ESRGAN_x4.pth</code>
 
# That was it, the results will be in the <code>results</code> folder
 
  
 
== Tips when using ESRGAN ==
 
== Tips when using ESRGAN ==

Latest revision as of 01:51, 31 October 2021

Make sure that you have followed the installation guide carefully (Windows, Arch Linux, MacOS). We support BlueAmulets fork offically, not the current ESRGAN branch from xinntao, if you use that one, you will get errors.

Now that you have installed ESRGAN you can upscale images. There are a few different ways of using ESRGAN. Below I will document the official one. This should work for everyone, but there are a few different applications designed to make the life of the users easier as well as to prevent common pitfalls of using ESRGAN.

Things you need to know before you use ESRGAN

  • ESRGAN supports only RGB images, that means it will remove alpha / transparency channels if present and it won't work with grayscale images.
  • ESRGAN is limited by the amount of VRAM you have.

But there are ways around both. At the time of writing this there are some popular tools used by a lot of us to solve this:

Tools / Ways to use ESRGAN

GUI Tools


Cupscale (Wiki) (GitHub)


Cupscale is a young ESRGAN GUI that's built with usability in mind.

Advantages
  • Run ESRGAN on CPU, Nvidia GPU or even AMD GPUs using Vulkan
  • Interpolate models on-the-fly (no need to create extra model files)
  • Create multiple types of comparisons
  • Batch upscale entire folders
  • Upscale GIFs and even videos
  • Multiple seamless modis are available
  • Supports many image formats including JPEG, PNG, WEBP, DDS, and more
Disadvantages
  • No cross platform support

IEU - Image Enhancement Utility (GitHub)


IEU has been used by many people and is considered mature, but if you encounter bugs or have feature requests, feel free to report them on Discord, or on GitHub repository by creating an issue. The Download is under the IEU.Winforms Releases tab. Please take a bit of your time to read IEU wiki first.

Advantages
  • Process any amount of images with all models you select via check-boxes
  • Control over the output format and naming scheme
  • Can use a different model for the alpha channel
  • Can upscale each channel separately (Red, Green, Blue and Alpha)
    This is useful for non diffuse textures, so normal maps (_n.dds typically), specular maps, ...
  • Can interpolate different models right in the GUI
Disadvantages
  • No cross platform support

CLI Tools


Joey's ESRGAN (Wiki) (GitHub)


Advantages
  • Cross-Platform
  • Works on headless systems
  • Auto splits and merges images in memory
  • Multiple seamless modis are available
  • It can chain models
  • As with any CLI tool, you can easily script around it to add further features.
Disadvantages
  • You have to enter the full model names if you don't wrap a script around it
  • CLIs can feel intimidating to those unused to them
Using DirectML with Joey's fork

Work in Progress Tutorial

Tips when using ESRGAN

  1. When upscaling compressed textures use a 1x decompression model for the format first and or downscale them first by at least 50% with ,code>nearest neighbor or box filtering first, before upscaling them in ESRGAN
  2. ESRGAN runs much faster on Nvidia GPUs, you can compile pytorch yourself for AMD GPUs but at the moment that is quite difficult to do
  3. If you run out of VRAM, use Deorder’s scripts or IEU, which will split the texture in smaller parts first
  4. If you have textures in sub-folders, use Deorder’s scripts or IEU with "Preserve folder structure" mode selected
  5. If your textures contain an alpha channel (transparency), use Deorder’s scripts or IEU
  6. Try out different models. In our Model Database you will find a lot of different models, that we trained ourself
  7. If you are still not happy with the results despite having tried out different models, consider training your own and sharing it with us later

Additional Notices

Windows

  • When upscaling large images (depending on your GPU for example 1000x1000px images) on Windows, it's possible for the operating system to kill the ESRGAN process if it takes too long. This can be fixed using the Nvidia Nsight Monitor app that is installed alongside the CUDA toolkit. Here are Instructions for doing so