Sorry for the brief Frank outage around 1 PM PST today (not sure anyone even noticed?).
Relatedly, TIL that the python smart_open package is a dangerous way to upload files to GCS!
It calls a method in the GCS python SDK in a manner that disables the SDK’s internal retry mechanism, while also not implementing retry logic of its own.
Instead, if the upload fails partway through, it just raises an exception and hands control back to you. And worse, it’s a so-called “resumable upload session,” which is supposed to let you resume the upload if it fails partway through … but smart_open doesn’t do that, and the exception it raises doesn’t contain the information you’d need to do that, even if you wanted to.
I lost over a week of logs due to this – not mission critical ones, but it’s still kind of a bummer. Oh well.


