Scaling Painting Style Transfer
B. Galerne
L. Raad
J. Lezama
J.-M. Morel

EGSR 2024 (CGF paper)


[Hal]
[ArXiv]
[GitHub]

Zoom into our ultra-high resolution result (8064x6048 pixels)

Abstract

Neural style transfer (NST) is a deep learning technique that produces an unprecedentedly rich style transfer from a style image to a content image. It is particularly impressive when it comes to transferring style from a painting to an image. NST was originally achieved by solving an optimization problem to match the global statistics of the style image while preserving the local geometric features of the content image. The two main drawbacks of this original approach is that it is computationally expensive and that the resolution of the output images is limited by high GPU memory requirements. Many solutions have been proposed to both accelerate NST and produce images with larger size. However, our investigation shows that these accelerated methods all compromise the quality of the produced images in the context of painting style transfer. Indeed, transferring the style of a painting is a complex task involving features at different scales, from the color palette and compositional style to the fine brushstrokes and texture of the canvas. This paper provides a solution to solve the original global optimization for ultra-high resolution (UHR) images, enabling multiscale NST at unprecedented image sizes. This is achieved by spatially localizing the computation of each forward and backward passes through the VGG network. Extensive qualitative and quantitative comparisons, as well as a perceptual study, show that our method produces style transfer of unmatched quality for such high-resolution painting styles. By a careful comparison, we show that state-of-the-art fast methods are still prone to artifacts, thus suggesting that fast painting style transfer remains an open problem.


UHR results visualizer

Style transfer results

(click on images to zoom in)

Style Content Result

(7914 x 6048)

(8064 x 6048)

(8064 x 6048)

(5319 x 4226)

(8064 x 6048)

(8064 x 6048)

Comparison with fast methods

(click on images to zoom in)

Style Content SPST (ours) CD URST

(3176 x 2606)

(4752 x 3168)

(4752 x 3168)

(4752 x 3168)

(4752 x 3168)

(3787 x 3024)

(4032 x 3024)

(4032 x 3024)

(4032 x 3024)

(4032 x 3024)

(4747 x 6000)

(5973 x 4480)

(5973 x 4480)

(5968 x 4480)

(5976 x 4482)

References:

Code




Algorithm overview: Our localized algorithm (right part) allows to compute the global style transfer loss and its gradient wrt x for images that are too large for the original algorithm of [Gatys et al. 2016] (left part).

 [GitHub]


Paper and supplementary material

B. Galerne, L. Raad, J. Lezama, J.-M. Morel.
Scaling Painting Style Transfer.
In EGSR 2024 (Computer Graphics Forum).
(HAL preprint)


[Bibtex]


Perceptual study

The perceptual study consisted of several evaluation instances, each of which compared four images: the style used for the transfer and the results of the three methods, which were displayed at random positions for each evaluation instance. Each participant was asked to select the result closest to the style of the style image among the three displayed results.

 [images_user_study.zip]



Full size UHR images


Most UHR images in the .pdf paper have been downscaled by a factor 4.
We provide below a link to the original results for better visualization.
 Dropbox link to full size UHR images

Acknowledgements

B. Galerne and L. Raad acknowledge the support of the project MISTIC (ANR-19-CE40-005).

This template was originally made by Phillip Isola and Richard Zhang for a colorful ECCV project; the code can be found here.