You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
some changes to improve clarity of task description, and general updates
to improve task page
---------
Co-authored-by: Pedro Cuenca <[email protected]>
Co-authored-by: Merve Noyan <[email protected]>
Co-authored-by: Omar Sanseviero <[email protected]>
Copy file name to clipboardExpand all lines: packages/tasks/src/tasks/image-to-image/about.md
+70-21Lines changed: 70 additions & 21 deletions
Original file line number
Diff line number
Diff line change
@@ -1,15 +1,10 @@
1
-
## Use Cases
2
-
3
-
### Style transfer
1
+
Image-to-image pipelines can also be used in text-to-image tasks, to provide visual guidance to the text-guided generation process.
4
2
5
-
One of the most popular use cases of image-to-image is style transfer. Style transfer models can convert a normal photography into a painting in the style of a famous painter.
6
-
7
-
## Task Variants
3
+
## Use Cases
8
4
9
5
### Image inpainting
10
6
11
-
Image inpainting is widely used during photography editing to remove unwanted objects, such as poles, wires, or sensor
12
-
dust.
7
+
Image inpainting is widely used during photography editing to remove unwanted objects, such as poles, wires, or sensor dust.
13
8
14
9
### Image colorization
15
10
@@ -24,18 +19,27 @@ Super-resolution models increase the resolution of an image, allowing for higher
24
19
You can use pipelines for image-to-image in 🧨diffusers library to easily use image-to-image models. See an example for `StableDiffusionImg2ImgPipeline` below.
25
20
26
21
```python
27
-
fromPILimport Image
28
-
from diffusers import StableDiffusionImg2ImgPipeline
22
+
import torch
23
+
from diffusers import AutoPipelineForImage2Image
24
+
from diffusers.utils import make_image_grid, load_image
Controlling the outputs of diffusion models only with a text prompt is a challenging problem. ControlNet is a neural network model that provides image-based control to diffusion models. Control images can be edges or other landmarks extracted from a source image.
62
+
### Style Transfer
63
+
64
+
One of the most popular use cases of image-to-image is style transfer. With style transfer models:
59
65
60
-
Many ControlNet models were trained in our community event, JAX Diffusers sprint. You can see the full list of the ControlNet models available [here](https://huggingface.co/spaces/jax-diffusers-event/leaderboard).
66
+
- a regular photo can be transformed into a variety of artistic styles or genres, such as a watercolor painting, a comic book illustration and more.
67
+
- new images can be generated using a text prompt, in the style of a reference input image.
68
+
69
+
See 🧨diffusers example for style transfer with `AutoPipelineForText2Image` below.
Controlling the outputs of diffusion models only with a text prompt is a challenging problem. ControlNet is a neural network model that provides image-based control to diffusion models. Control images can be edges or other landmarks extracted from a source image.
Pix2Pix is a popular model used for image-to-image translation tasks. It is based on a conditional-GAN (generative adversarial network) where instead of a noise vector a 2D image is given as input. More information about Pix2Pix can be retrieved from this [link](https://phillipi.github.io/pix2pix/) where the associated paper and the GitHub repository can be found.
65
109
@@ -70,8 +114,13 @@ The images below show some examples extracted from the Pix2Pix paper. This model
70
114
## Useful Resources
71
115
72
116
-[Image-to-image guide with diffusers](https://huggingface.co/docs/diffusers/using-diffusers/img2img)
117
+
- Image inpainting: [inpainting with 🧨diffusers](https://huggingface.co/docs/diffusers/main/en/api/pipelines/stable_diffusion/inpaint), [demo](https://huggingface.co/spaces/diffusers/stable-diffusion-xl-inpainting)
- Super resolution: [image upscaling with 🧨diffusers](https://huggingface.co/docs/diffusers/main/en/api/pipelines/stable_diffusion/upscale#super-resolution), [demo](https://huggingface.co/spaces/radames/Enhance-This-HiDiffusion-SDXL)
120
+
-[Style transfer and layout control with diffusers 🧨](https://huggingface.co/docs/diffusers/main/en/using-diffusers/ip_adapter#style--layout-control)
73
121
-[Train your ControlNet with diffusers 🧨](https://huggingface.co/blog/train-your-controlnet)
74
122
-[Ultra fast ControlNet with 🧨 Diffusers](https://huggingface.co/blog/controlnet)
123
+
-[List of ControlNets trained in the community JAX Diffusers sprint](https://huggingface.co/spaces/jax-diffusers-event/leaderboard)
"Image-to-image is the task of transforming a source image to match the characteristics of a target image or a target image domain. Any image manipulation and enhancement is possible with image to image models.",
96
+
"Image-to-image is the task of transforming an input image through a variety of possible manipulations and enhancements, such as super-resolution, image inpainting, colorization, and more.",
0 commit comments