comparsion of gpt models

a table of evaluation metrics for gpt models
intro i put together a table of various metrics for all the main gpt models. i think this is a useful resource for selecting appropriate models based on needs and computing costs.
more (200 words) →

micro-finetuning gpt2

notes on finetuning gpt2 with a tiny dataset
introduction disclaimer i am primarily a software engineer, and thus am quite an amateur at machine learning.
more (1900 words) →

keras on amd with plaidml

setup for amd gpu acceleration for keras
intro this week i wanted to revisit one of my old machine learning projects. that project used the excellent keras library for building the model.
more (300 words) →