Scaling Laws for Neural Language Models

Get the empirical scaling laws for language model performance on the cross-entropy loss.

Video

References

Scaling Laws for Neural Language Models