We present Corgi, a novel method for text-to-image generation. Corgi is based on our proposed shifted diffusion model, which achieves better image embedding generation from input text. Unlike the baseline diffusion model used in DALL-E 2, our method seamlessly encodes prior knowledge of the pre-trained CLIP model in its diffusion process by designing a new initialization distribution and a new transition step of the diffusion. Compared to the strong DALL-E 2 baseline, our method performs better in generating image embedding from the text in terms of both efficiency and effectiveness, resulting in better text-to-image generation. Extensive large-scale experiments are conducted and evaluated in terms of both quantitative measures and human ev...
Recent text-to-image models have achieved impressive results. However, since they require large-scal...
Recently, there has been an increasing interest in developing diffusion-based text-to-image generati...
Score-based diffusion models have captured widespread attention and funded fast progress of recent v...
Can a text-to-image diffusion model be used as a training objective for adapting a GAN generator to ...
Taking advantage of the many recent advances in deep learning, text-to-image generative models curre...
Image captioning task has been extensively researched by previous work. However, limited experiments...
Large-scale text-to-image generative models have been a revolutionary breakthrough in the evolution ...
Diffusion models have shown impressive results in text-to-image synthesis. Using massive datasets of...
The excellent generative capabilities of text-to-image diffusion models suggest they learn informati...
Existing text-to-image diffusion models struggle to synthesize realistic images given dense captions...
Despite the impressive results of arbitrary image-guided style transfer methods, text-driven image s...
Can continuous diffusion models bring the same performance breakthrough on natural language they did...
Text-to-image synthesis for the Chinese language poses unique challenges due to its large vocabulary...
Recent progress in diffusion models has revolutionized the popular technology of text-to-image gener...
Generative image synthesis with diffusion models has recently achieved excellent visual quality in s...
Recent text-to-image models have achieved impressive results. However, since they require large-scal...
Recently, there has been an increasing interest in developing diffusion-based text-to-image generati...
Score-based diffusion models have captured widespread attention and funded fast progress of recent v...
Can a text-to-image diffusion model be used as a training objective for adapting a GAN generator to ...
Taking advantage of the many recent advances in deep learning, text-to-image generative models curre...
Image captioning task has been extensively researched by previous work. However, limited experiments...
Large-scale text-to-image generative models have been a revolutionary breakthrough in the evolution ...
Diffusion models have shown impressive results in text-to-image synthesis. Using massive datasets of...
The excellent generative capabilities of text-to-image diffusion models suggest they learn informati...
Existing text-to-image diffusion models struggle to synthesize realistic images given dense captions...
Despite the impressive results of arbitrary image-guided style transfer methods, text-driven image s...
Can continuous diffusion models bring the same performance breakthrough on natural language they did...
Text-to-image synthesis for the Chinese language poses unique challenges due to its large vocabulary...
Recent progress in diffusion models has revolutionized the popular technology of text-to-image gener...
Generative image synthesis with diffusion models has recently achieved excellent visual quality in s...
Recent text-to-image models have achieved impressive results. However, since they require large-scal...
Recently, there has been an increasing interest in developing diffusion-based text-to-image generati...
Score-based diffusion models have captured widespread attention and funded fast progress of recent v...