Tuning GPT2 on 305K lines of dialog from popular films

Below is the Hugging Face space for my version of GPT-2 I quickly tuned on movie dialog. The original dataset is from Cristian Danescu-Niculescu-Mizil at the University of Cornell and can be found in a Hugging Face dataset here.

This was done on the free version of Google Colab and took ~90 mins and is hosted in a free space, and uses the version of GPT-2 with 137M parameters.

Updated: