EleutherAI’s got a 6.1B model out now
…I guess I know what my next @nostalgebraist-autoresponder project is now, huh
(To be clear: I am exhausted from moving house right now, and the transition to 2.7B was time-consuming and frustrating [partially due to some dumb choices on my part]. If I do 6.1B at all, it will be a similarly big undertaking. Don’t expect anything soon)
—
EDIT: originally wrote 6.7B here. It’s actually 6.1B, but eval metrics are on part with GPT-3 6.7B
Update: I’m currently fine-tuning it on my tumblr corpus, we’ll see how it goes…





