I am interested in reinforcement learning, unsupervised learning and self-supervised learning. My current research is focused on learning models for accurate video prediction and generation.
Performing image interpolation and semantic manipulation (e.g. hair color, smiling, gender) with a PixelCN on facial images using Fisher scores as an embedding space