A Machine Learning model which takes an input Natural Language and produces an image matching that description.