Pinned Loading
-
pretraining-with-human-feedback
pretraining-with-human-feedback PublicCode accompanying the paper Pretraining Language Models with Human Preferences
-
active-inference
active-inference PublicA toy model of Friston's active inference in Tensorflow
-
bliss-attractors
bliss-attractors PublicA toy Inspect implementation of the Bliss Attractor eval from Claude 4 System Card Welfare Assessment
-
-
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.