December 21, 2020 1 min to read Scaling Laws for Neural Language Models Get the empirical scaling laws for language model performance on the cross-entropy loss. Video References Scaling Laws for Neural Language Models
Comments