In July I wrote a long, unfinished rant about language model scaling and evaluation on LW.
It sat in my drafts folder there for a long time, and it became clear to me that I was never going to go back and properly finish it.
So I went ahead and published it as-is, with a brief note at the start explaining the situation.
If you liked my earlier posts about GPT models – the ones that weren’t about Frank, I mean – this one may also interest you.
Some updates:
Janus (AKA moire), the author of generative.ink, left a long and interesting comment.
I replied with a long comment of my own, long enough it could be a post in itself.
—-
Surprisingly, to me anyway, this post has rapidly become my highest-karma post on LW, eclipsing even the Gary Marcus post.
So my worries from this thread with @the-moti were clearly unfounded. The new post takes a “boo AI!” tone, reiterates my original critiques of the GPT-3 paper, etc., but none of that caused it get a negative reception on LW.














