This repo contains lots of datasets for various ML models.
All content is licensed under the GNU GPL v3 license.
Dataset | Archive | Description | Resolution/Char Count | Colab notebook |
---|---|---|---|---|
Capital English letters | Download | The entire English alphabet, in capital letters, in Segoe UI Bold. | 32x32 | - |
TechCrunch articles about startups | Download | Various TechCrunch articles about startups. | 48180 | GPT-2 |