A Neural Representation of Sketch Drawings

David Ha; Douglas Eck

arxiv: 1704.03477 · v4 · pith:WEFXIJBTnew · submitted 2017-04-11 · 💻 cs.NE · cs.LG· stat.ML

A Neural Representation of Sketch Drawings

David Ha , Douglas Eck This is my paper

classification 💻 cs.NE cs.LGstat.ML

keywords drawingssketchneuralableclassescoherentcommonconditional

0 comments

read the original abstract

We present sketch-rnn, a recurrent neural network (RNN) able to construct stroke-based drawings of common objects. The model is trained on thousands of crude human-drawn images representing hundreds of classes. We outline a framework for conditional and unconditional sketch generation, and describe new robust training methods for generating coherent sketch drawings in a vector format.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 6 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Designing streetscapes from street-view imagery using diffusion models
cs.CV 2026-05 conditional novelty 7.0

A multimodal diffusion model generates controllable alternative streetscapes from street-view imagery using visual metrics and text, shown on Chicago and Orlando data with gains in semantic consistency.
VAnim: Rendering-Aware Sparse State Modeling for Structure-Preserving Vector Animation
cs.CV 2026-05 unverdicted novelty 7.0

VAnim creates open-domain text-to-SVG animations via sparse state updates on a persistent DOM tree, identification-first planning, and rendering-aware RL with a new 134k-example benchmark.
Stroke of Surprise: Progressive Semantic Illusions in Vector Sketching
cs.CV 2026-02 unverdicted novelty 7.0

Stroke of Surprise is a framework that generates vector sketches undergoing semantic transformation from one concept to another by adding strokes, using dual-branch SDS and overlay loss for optimization.
PaintCopilot: Modeling Painting as Autonomous Artistic Continuation
cs.CV 2026-05 unverdicted novelty 6.0

PaintCopilot models painting as an open-ended autoregressive process that predicts coherent brushstrokes from partial canvas observations using a ViT target predictor, flow-matching stroke generator, and VAE region sampler.
When Drawing Is Not Enough: Exploring Spontaneous Speech with Sketch for Intent Alignment in Multimodal LLMs
cs.HC 2026-04 unverdicted novelty 6.0

Adding spontaneous speech transcripts to sketches significantly improves multimodal LLMs' ability to generate design images aligned with designers' intent across form, function, experience, and overall.
Sketch and Text Synergy: Fusing Structural Contours and Descriptive Attributes for Fine-Grained Image Retrieval
cs.CV 2026-04 unverdicted novelty 5.0

STBIR fuses sketches and text via curriculum robustness, category optimization, and staged alignment to outperform prior methods on a new fine-grained benchmark dataset.