Papers that may be helpful when training LLMs.
Created 9 months ago
We investigate the optimal model size and number of tokens for training a transformer language model...
Added ago
Login to subscribe this collection.