On-Policy Distillation as Active Inference in Finite Variational Models

Catalog Row171

Citation KeyFriedman2026PolicyDistillationAsActive171

Paper FolderAvailable

DOI10.5281/zenodo.20747834

Platform availability

✅ Zenodo
✅ GitHub
⬜ arXiv
⬜ OSF
⬜ HuggingFace
⬜ Software Heritage
⬜ PyPI
✅ Full documentation

Overview

Extracted from the local paper documentation when available.

This paper formulates on-policy distillation as active inference in finite variational models, with exact claims only for declared objects and interpretive claims explicitly bounded outside them. In the construction, the intractable teacher policy plays the role of the generative model $p(o,s)$, the tractable student policy is the approximate posterior $q(s)$, and the per-token reverse-KL...

on-policy distillationactive inferenceself-distillationprivileged informationfree energy principlereverse KL divergencepymdpsophisticated inference

Use Notes

Concise findings and methods pulled from README/SKILL documentation.

Findings / Concepts

This paper formulates on-policy distillation as active inference in finite variational models, with exact claims only for declared objects and interpretive claims explicitly bounded outside them.
In the construction, the intractable teacher policy plays the role of the generative model $p(o,s)$, the tractable student policy is the approximate posterior $q(s)$, and the per-token reverse-KL dist

Methods / Techniques

Free energy minimization
Bayesian modeling and inference

Citation

Plain-text citation for quick reuse.

Friedman, Daniel Ari. 2026. On-Policy Distillation as Active Inference in Finite Variational Models. Zenodo. DOI: 10.5281/zenodo.20747834. URL: https://doi.org/10.5281/zenodo.20747834.

Primary source Documentation Full Text Image Gallery Source repository BibTeX

Related in Active Inference

Other catalogued works in the same domain.

View all Active Inference works, software & media →