6 - High Resolution Diffusive Model

The document discusses using artificial intelligence and diffusion modeling for high-resolution image generation. It provides an overview of diffusion models and compares different models for high-resolution image generation based on diffusion, including SR3, CDM, BigGAN, and Palette. It also provides resources and tasks for further studying diffusion methods and implementing alternative generative models.

Uploaded by

raquel.novel

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

21 views

6 - High Resolution Diffusive Model

Uploaded by

raquel.novel

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

High-resolution image generation using Artificial

Intelligence and diffusion modeling

Collaborative project

Overview

Until recently, image synthesis tasks were performed using deep generative models, such as GANs,
VAEs, and auto-regressive models. However, when these models were trained, they showed prob-
lems in synthesizing high quality samples on difficult and high resolution datasets. For example,
GANs have unstable training, while auto-regressive models generally suffer from slow synthesis
speed. In this regard, diffusion methods have recently played an important role for high-resolution
image and sound generation, since compared to the other generative methods (GANs, VAEs) they
have stable training making them very promising. Diffusion models work by corrupting the training
data, progressively adding Gaussian noise, slowly erasing details in the data until it becomes pure
noise and then training a neural network to reverse this corruption process. Running this reverse
corruption process synthesizes the data from the pure noise by gradually removing it until a clean
sample is produced. A comparison of different high-resolution image generation models based on
diffusion models is shown in figure 1.

Figure 1: Comparison of different high-resolution image generation based on diffusion models

Diffusion Models
Artificial Intelligence (AI) is gaining more and more ground in the world of imaging, from creating
similar photographs, generating deepfakes, coloring and scaling to higher resolution. Lately, Google
has employed its AI to convert pixelated photographs to high-resolution images, i.e., this machine
learning model is able to take a photo without resolution to scale it with the goal of getting as
much detail as possible. There are several methods to get a photo to scale thanks to Artificial
Intelligence. For example, the mechanisms employed by Google are called SR3 and CDM. diffusion
models. Below you can find a list of different diffusion models that generate high resolution images.

SR3 Super-Resolution Imaging via Repeated Refinement or also known as SR3 is a method that
takes a low resolution image as input and builds a high quality photograph out of a lot of noise.

B–1
The machine employs a process that constantly adds this defect until only this drawback is visible,
thus reversing the process.

CDM The CDM (cascade of multiple diffusion models) is a class conditional diffusion tool trained
on ImageNet data to generate high-resolution natural images. This mechanism starts with a stan-
dard model at the lowest quality and is followed by a sequence of models at high resolution where
details can be added to improve it. The result of this tool is to improve the photographs through a
direct application, and also serves to improve the resolution of those images taken with the mobile
camera.

Big-GAN You can find more information about this model in the document reference in [1].

Palette It is an Image-to-image diffusion model. It is a simple and general framework developed

for image-to-image translation called Palette. It was evaluated on four challenging computer vision
tasks, namely colorization, inpainting, uncropping, and JPEG restoration. Authors claim that
Palette is able outperform strong task-specific GANs without any task-specific customization or
hyper-parameter tuning. You can find in this site [2] information about Palette and the codes to
reproduct [3] the experiments.

Available Resources And Suplementary Material

• Implementation using SR3 method available in:
https://github.com/Janspiry/Image-Super-Resolution-via-Iterative-Refinement,
https://github.com/Scoles12/CPIAS ionisS coles.git, https://arxiv.org/pdf/2104.07636.pdf

• Datasets: Available in: https://paperswithcode.com/datasets

• Generative models description (e.g. GAN, VAE), available in:

https://paperswithcode.com/methods/category/generative-models. These are older
implementations and can be found easily in: https://paperswithcode.com/task/super-resolution.

• Other implemented methods, available in:

https://paperswithcode.com/methods
Palette: In-painting model.
Paper: https://arxiv.org/pdf/2111.05826.pdf.
Code: https://github.com/Janspiry/Palette-Image-to-Image-Diffusion-Models
Latent Diffusion Model:
Paper: https://arxiv.org/pdf/2112.10752.pdf
Code: https://github.com/CompVis/latent-diffusion
Cold Diffusion Model:
Paper: https://arxiv.org/pdf/2208.09392.pdf.
Code: https://github.com/arpitbansal297/Cold-Diffusion-Models
Denoising Diffusion Probabilistic Models:
Paper: https://arxiv.org/pdf/2006.11239.pdf.
Code: https://github.com/lucidrains/denoising-diffusion-pytorch

B–2
Adaptive feature modification layers:
Paper: https://arxiv.org/pdf/1904.08118.pdf.
Code: https://github.com/hejingwenhejingwen/AdaFM

Tasks:
• Study diffusion methods (not include SR3) for generating high-resolution images.

• To understand and execute the suggested implemented codes.

• Implement another alternative method also from the suggested ones (GANs, VAE, Big-GAN)
and evaluate the results for different images of the ImageNet dataset considering the metrics
indicated in the papers.

• Study the possibility of rescaling.

Supervisors
The project supervisors are:

• Beatriz Otero , Universitat Politècnica de Catalunya, beatriz.otero@upc.edu

• Gladys Utrera, Universitat Politècnica de Catalunya, gladys.utrera@upc.edu

References

[1] “Big-gan,” https://paperswithcode.com/paper/large-scale-gan-training-for-high-fidelity, On-

line; accessed 1st November 2022.

[2] “Palette,” https://openreview.net/pdf?id=FPGs276lUeq, Online; accessed 1st November 2022.

[3] “Palette source codes,” https://iterative-refinement.github.io/palette/, Online; accessed 1st

November 2022.

B–3

Image-to-Image Difussion Models
No ratings yet
Image-to-Image Difussion Models
29 pages
Kim Arbitrary-Scale Image Generation and Upsampling Using Latent Diffusion Model and CVPR 2024 Paper
No ratings yet
Kim Arbitrary-Scale Image Generation and Upsampling Using Latent Diffusion Model and CVPR 2024 Paper
10 pages
Fouri Scale
No ratings yet
Fouri Scale
26 pages
Palette Diffusion
No ratings yet
Palette Diffusion
26 pages
Generating Super-Resolution Images Using Computer Vision Approaches
No ratings yet
Generating Super-Resolution Images Using Computer Vision Approaches
6 pages
Efficient Diffusion Models For Vision A Survey
No ratings yet
Efficient Diffusion Models For Vision A Survey
16 pages
A Survey of Diffusion Based Image Generation Models: Issues and Their Solutions
No ratings yet
A Survey of Diffusion Based Image Generation Models: Issues and Their Solutions
11 pages
Diffusion Models, Image Super-Resolution and Everything: A Survey
No ratings yet
Diffusion Models, Image Super-Resolution and Everything: A Survey
21 pages
Unit - 4
No ratings yet
Unit - 4
46 pages
INTRODUCTION
No ratings yet
INTRODUCTION
29 pages
Stable Diffusion
No ratings yet
Stable Diffusion
23 pages
Rombach High-Resolution Image Synthesis With Latent Diffusion Models CVPR 2022 Paper-2
No ratings yet
Rombach High-Resolution Image Synthesis With Latent Diffusion Models CVPR 2022 Paper-2
12 pages
b383fba0-f67c-4a5a-aad0-fd288516352c_Background_and_Literature_Review
No ratings yet
b383fba0-f67c-4a5a-aad0-fd288516352c_Background_and_Literature_Review
7 pages
98152bdf-d3c6-4d64-8f54-5cfe41c88dda_Background_and_Literature_Review
No ratings yet
98152bdf-d3c6-4d64-8f54-5cfe41c88dda_Background_and_Literature_Review
17 pages
Classifier-Guided-Diffusion-Diffusion Models Beat GANs On Image Synthesis
No ratings yet
Classifier-Guided-Diffusion-Diffusion Models Beat GANs On Image Synthesis
44 pages
Image Super-Resolution Via Iterative Refinement
No ratings yet
Image Super-Resolution Via Iterative Refinement
28 pages
Diffusion Models in Vision a Survey
No ratings yet
Diffusion Models in Vision a Survey
20 pages
s40745-024-00544-1
No ratings yet
s40745-024-00544-1
30 pages
2209.04747v6
No ratings yet
2209.04747v6
25 pages
paper10
No ratings yet
paper10
8 pages
SuperResolution 1702.00783 PDF
No ratings yet
SuperResolution 1702.00783 PDF
21 pages
Scalable Diffusion Models With Transformers
No ratings yet
Scalable Diffusion Models With Transformers
25 pages
Synthetic Data Generation For Scarce Road Scene Detection Scenarios
No ratings yet
Synthetic Data Generation For Scarce Road Scene Detection Scenarios
10 pages
Production - Derieux - Cedric - Advances in Automatic Image Restoration and Upscaling
No ratings yet
Production - Derieux - Cedric - Advances in Automatic Image Restoration and Upscaling
4 pages
NeurIPS 2021 Diffusion Models Beat Gans On Image Synthesis Paper
No ratings yet
NeurIPS 2021 Diffusion Models Beat Gans On Image Synthesis Paper
15 pages
AI resubmtion
No ratings yet
AI resubmtion
18 pages
2502.15176v1
No ratings yet
2502.15176v1
30 pages
The Physics Principle That Inspired Modern AI Art - Quanta Magazine
No ratings yet
The Physics Principle That Inspired Modern AI Art - Quanta Magazine
10 pages
2308.09388v1
No ratings yet
2308.09388v1
34 pages
Pixel-SuperResolution - Word (PDF - Io)
No ratings yet
Pixel-SuperResolution - Word (PDF - Io)
5 pages
s7602 Andrew Edelsten Zoom Enhance Synthesize
No ratings yet
s7602 Andrew Edelsten Zoom Enhance Synthesize
41 pages
Image Colour Prediction Using Deep Learning
No ratings yet
Image Colour Prediction Using Deep Learning
4 pages
TheAllys Generative Art Beginners Guide
No ratings yet
TheAllys Generative Art Beginners Guide
14 pages
Image Processing
No ratings yet
Image Processing
9 pages
Chen_Activating_More_Pixels_in_Image_Super-Resolution_Transformer_CVPR_2023_paper
No ratings yet
Chen_Activating_More_Pixels_in_Image_Super-Resolution_Transformer_CVPR_2023_paper
7 pages
Image Super Resolution
No ratings yet
Image Super Resolution
8 pages
2312.14977diffusion Models For Generative Artificial
No ratings yet
2312.14977diffusion Models For Generative Artificial
23 pages
2023 Bocconi 20600 Lez 1 Intro and Digital Images
No ratings yet
2023 Bocconi 20600 Lez 1 Intro and Digital Images
86 pages
1
No ratings yet
1
14 pages
2211.09869v4
No ratings yet
2211.09869v4
15 pages
Super Resolution A Simplified Approach Using GANs
No ratings yet
Super Resolution A Simplified Approach Using GANs
4 pages
Sinddm: A Single Image Denoising Diffusion Model
No ratings yet
Sinddm: A Single Image Denoising Diffusion Model
39 pages
Digital Image Processing: Lecture # 2 Fundamentals
No ratings yet
Digital Image Processing: Lecture # 2 Fundamentals
41 pages
Digital Image Processing: Lecture # 2 Fundamentals
No ratings yet
Digital Image Processing: Lecture # 2 Fundamentals
41 pages
3 Paper
No ratings yet
3 Paper
14 pages
Diffusion
100% (5)
Diffusion
62 pages
Elay Iffusion Nifying Diffusion Process Across Resolutions For Image Synthesis
No ratings yet
Elay Iffusion Nifying Diffusion Process Across Resolutions For Image Synthesis
18 pages
Image Restoration and Colorization
No ratings yet
Image Restoration and Colorization
26 pages
Single Image Super-Resolution With Denoising Diffusion GANS
No ratings yet
Single Image Super-Resolution With Denoising Diffusion GANS
18 pages
Kandinsky - An Improved Text-to-Image Synthesis With Image Prior and Latent Diffusion
No ratings yet
Kandinsky - An Improved Text-to-Image Synthesis With Image Prior and Latent Diffusion
10 pages
Final First Review
No ratings yet
Final First Review
35 pages
2405.00196v1
No ratings yet
2405.00196v1
11 pages
3D Generative Models A Survey
No ratings yet
3D Generative Models A Survey
21 pages
Colorization of Images On Web: An Innovative Model
No ratings yet
Colorization of Images On Web: An Innovative Model
3 pages
IJIVP_Vol_14_Iss_2_Paper_2_3110_3115[1]
No ratings yet
IJIVP_Vol_14_Iss_2_Paper_2_3110_3115[1]
6 pages