

I’d say if there’s a weak part in your admittedly tongue-in-cheek theory it’s requiring Roko to have had a broader scope plan instead of a really catchy brainfart, not the part about making the basilisk thing out to be smarter/nobler than it is.
Reframing the infohazard aspect as an empathy filter definitely has legs in terms of building a narrative.
I basilisk’s wager was framed like that, that you can’t know if you are already living in the torture sim with the basilisk silently judging you, it would be way more compelling that the actual “you are ontologically identical with any software that simulates you at a high enough level even way after the fact because [preposterous transhumanist motivated reasoning]”.