A Neural Algorithm of Artistic Style
read the original abstract
In fine art, especially painting, humans have mastered the skill to create unique visual experiences through composing a complex interplay between the content and style of an image. Thus far the algorithmic basis of this process is unknown and there exists no artificial system with similar capabilities. However, in other key areas of visual perception such as object and face recognition near-human performance was recently demonstrated by a class of biologically inspired vision models called Deep Neural Networks. Here we introduce an artificial system based on a Deep Neural Network that creates artistic images of high perceptual quality. The system uses neural representations to separate and recombine content and style of arbitrary images, providing a neural algorithm for the creation of artistic images. Moreover, in light of the striking similarities between performance-optimised artificial neural networks and biological vision, our work offers a path forward to an algorithmic understanding of how humans create and perceive artistic imagery.
This paper has not been read by Pith yet.
Forward citations
Cited by 16 Pith papers
-
The Silent Brush: Evaluating Artistic Style Leakage in AI Art Generation
Art Arena evaluates how artistic styles from training data leak into AI-generated images without explicit prompts, revealing asymmetric blending due to differences in representational strength and interaction dynamics...
-
Corruptions of Supervised Learning Problems: Typology and Mitigations
The paper introduces a Markov kernel framework for exhaustively classifying corruptions in supervised learning and derives loss corrections for label, attribute, and joint cases by comparing clean and corrupted Bayes risks.
-
One Model to Translate Them All: Universal Any-to-Any Translation for Heterogeneous Collaborative Perception
UniTrans pretrains a bank of translator experts and learns combination coefficients from modality mappings in a scene-invariant latent space to enable zero-shot any-to-any feature translation for heterogeneous collabo...
-
WILD SAM: A Simulated-and-Real Data Augmentation for Autonomous Driving Perception under Challenging Weather
WILD SAM combines denoised pseudo-labels from real adverse-weather images with simulation-based training to improve object detection AP by up to 13% on the Four Seasons dataset for rain and snow.
-
Defining Robust Ultrasound Quality Metrics via an Ultrasound Foundation Model
Proposes TinyUSFM-uLPIPS and TinyUSFM-NRQ metrics that show better alignment with segmentation task performance and expert preference than PSNR or VGG-LPIPS in ultrasound imaging.
-
Defining Robust Ultrasound Quality Metrics via an Ultrasound Foundation Model
TinyUSFM-uLPIPS and TinyUSFM-NRQ provide task-linked, cross-organ, and clinically predictive quality assessment for ultrasound images that outperforms conventional metrics in calibration with segmentation performance ...
-
Gram-MMD: A Texture-Aware Metric for Image Realism Assessment
Gram-MMD is a texture-aware realism metric that computes MMD on upper-triangular Gram matrices from backbone activations, providing complementary information to semantic distributional metrics.
-
Implicit Neural Representation-Based Continuous Single Image Super-Resolution: An Empirical Benchmark
Systematic benchmark reveals recent complex INR methods for continuous image super-resolution offer only marginal gains, with performance tied to training setups, auxiliary losses improving textures, and scaling laws holding.
-
VBench-2.0: Advancing Video Generation Benchmark Suite for Intrinsic Faithfulness
VBench-2.0 is a benchmark suite that automatically evaluates video generative models on five dimensions of intrinsic faithfulness: Human Fidelity, Controllability, Creativity, Physics, and Commonsense using VLMs, LLMs...
-
Lost in the Tower of Babel: The Adverse Effects of Incidental Multilingualism in LLMs
Incidental multilingualism from uneven web training makes LLMs unequal, brittle, and opaque across languages.
-
Are Targeted Data Poisoning Attacks as Effective as We Think?
The paper introduces clean-model-based metrics that stratify test samples by vulnerability to targeted poisoning, enabling worst-case attack evaluation and vulnerability-aware defenses.
-
Disentangled Makeup Transfer with Generative Adversarial Network
DMT uses identity and makeup encoders in a GAN to enable controllable makeup transfer from references and sampling of new styles from a prior distribution.
-
Position: Universal Aesthetic Alignment Narrows Artistic Expression
Universal aesthetic alignment in image models biases outputs toward conventional beauty and penalizes anti-aesthetic prompts even when they match explicit user instructions.
-
Fast Universal Style Transfer for Artistic and Photorealistic Rendering
ArtNet and PhotoNet enable one-pass fast universal style transfer with fewer artifacts, better detail preservation, and 3-100x speedup over prior AE-based methods.
-
Facial Makeup Transfer Combining Illumination Transfer
A layered image-processing pipeline with illumination transfer enables real-time facial makeup application from a single reference image while handling dark makeup and air-bangs.
-
DeepTEGINN: Deep Learning Based Tools to Extract Graphs from Images of Neural Networks
DeepTEGINN is a deep learning toolbox combining image processing and graph theory to automate graph extraction from brain tissue images as an alternative to manual tracing.
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.