Super-resolution (SR) aims at recovering a high-resolution (HR) image or video clip from its corresponding low-resolution (LR) one. Despite that convolutional neural networks (CNN) have recently demonstrated high-quality reconstruction for super-resolution, realistic SR, i.e., recovering natural and realistic textures, remains a challenging problem. The reason is that SR is an intrinsically ill-posed problem. CNN-based methods mitigate this problem by restricting the solution space by priors learned from the external dataset. However, such general priors are not enough for realistic SR. This thesis attempts to address the ill-posed challenge from two aspects: incorporating semantic priors and pushing solutions to a more accurate natural ima...