Taming Transformers for High-Resolution Image Synthesis
In Taming Transformers for High-Resolution Image Synthesis , Esser et al. (2020) present “the first results on semantically-guided synthesis of megapixel images with transformers” — high-resolution AI-generated pictures! The samples on the project’s website are super impressive. Their model is “a convolutional VQGAN, which learns a codebook of context-rich visual parts, whose composition is modeled with an autoregressive transformer.”