diff --git a/dev/.documenter-siteinfo.json b/dev/.documenter-siteinfo.json
index 9a9a143..8d5ac21 100644
--- a/dev/.documenter-siteinfo.json
+++ b/dev/.documenter-siteinfo.json
@@ -1 +1 @@
-{"documenter":{"julia_version":"1.10.4","generation_timestamp":"2024-07-24T22:30:49","documenter_version":"1.3.0"}}
\ No newline at end of file
+{"documenter":{"julia_version":"1.10.4","generation_timestamp":"2024-08-01T17:43:35","documenter_version":"1.3.0"}}
\ No newline at end of file
diff --git a/dev/api/index.html b/dev/api/index.html
index ddbaade..d910c19 100644
--- a/dev/api/index.html
+++ b/dev/api/index.html
@@ -1,12 +1,12 @@
 <!DOCTYPE html>
-<html lang="en"><head><meta charset="UTF-8"/><meta name="viewport" content="width=device-width, initial-scale=1.0"/><title>API Documentation · CompressedBeliefMDPs</title><meta name="title" content="API Documentation · CompressedBeliefMDPs"/><meta property="og:title" content="API Documentation · CompressedBeliefMDPs"/><meta property="twitter:title" content="API Documentation · CompressedBeliefMDPs"/><meta name="description" content="Documentation for CompressedBeliefMDPs."/><meta property="og:description" content="Documentation for CompressedBeliefMDPs."/><meta property="twitter:description" content="Documentation for CompressedBeliefMDPs."/><script data-outdated-warner src="../assets/warner.js"></script><link href="https://cdnjs.cloudflare.com/ajax/libs/lato-font/3.0.0/css/lato-font.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/juliamono/0.050/juliamono.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/fontawesome.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/solid.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/brands.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/KaTeX/0.16.8/katex.min.css" rel="stylesheet" type="text/css"/><script>documenterBaseURL=".."</script><script src="https://cdnjs.cloudflare.com/ajax/libs/require.js/2.3.6/require.min.js" data-main="../assets/documenter.js"></script><script src="../search_index.js"></script><script src="../siteinfo.js"></script><script src="../../versions.js"></script><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../assets/themes/documenter-dark.css" data-theme-name="documenter-dark" data-theme-primary-dark/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../assets/themes/documenter-light.css" data-theme-name="documenter-light" data-theme-primary/><script src="../assets/themeswap.js"></script></head><body><div id="documenter"><nav class="docs-sidebar"><div class="docs-package-name"><span class="docs-autofit"><a href="../">CompressedBeliefMDPs</a></span></div><button class="docs-search-query input is-rounded is-small is-clickable my-2 mx-auto py-1 px-2" id="documenter-search-query">Search docs (Ctrl + /)</button><ul class="docs-menu"><li><a class="tocitem" href="../">CompressedBeliefMDPs.jl</a></li><li><a class="tocitem" href="../samplers/">Samplers</a></li><li><a class="tocitem" href="../compressors/">Compressors</a></li><li><a class="tocitem" href="../circular/">Environments</a></li><li class="is-active"><a class="tocitem" href>API Documentation</a><ul class="internal"><li><a class="tocitem" href="#Contents"><span>Contents</span></a></li><li><a class="tocitem" href="#Index"><span>Index</span></a></li><li><a class="tocitem" href="#Types/Functors"><span>Types/Functors</span></a></li><li><a class="tocitem" href="#Functions"><span>Functions</span></a></li></ul></li></ul><div class="docs-version-selector field has-addons"><div class="control"><span class="docs-label button is-static is-size-7">Version</span></div><div class="docs-selector control is-expanded"><div class="select is-fullwidth is-size-7"><select id="documenter-version-selector"></select></div></div></div></nav><div class="docs-main"><header class="docs-navbar"><a class="docs-sidebar-button docs-navbar-link fa-solid fa-bars is-hidden-desktop" id="documenter-sidebar-button" href="#"></a><nav class="breadcrumb"><ul class="is-hidden-mobile"><li class="is-active"><a href>API Documentation</a></li></ul><ul class="is-hidden-tablet"><li class="is-active"><a href>API Documentation</a></li></ul></nav><div class="docs-right"><a class="docs-navbar-link" href="https://github.com/JuliaPOMDP/CompressedBeliefMDPs.jl" title="View the repository on GitHub"><span class="docs-icon fa-brands"></span><span class="docs-label is-hidden-touch">GitHub</span></a><a class="docs-navbar-link" href="https://github.com/JuliaPOMDP/CompressedBeliefMDPs.jl/blob/main/docs/src/api.md" title="Edit source on GitHub"><span class="docs-icon fa-solid"></span></a><a class="docs-settings-button docs-navbar-link fa-solid fa-gear" id="documenter-settings-button" href="#" title="Settings"></a><a class="docs-article-toggle-button fa-solid fa-chevron-up" id="documenter-article-toggle-button" href="javascript:;" title="Collapse all docstrings"></a></div></header><article class="content" id="documenter-page"><h1 id="API-Documentation"><a class="docs-heading-anchor" href="#API-Documentation">API Documentation</a><a id="API-Documentation-1"></a><a class="docs-heading-anchor-permalink" href="#API-Documentation" title="Permalink"></a></h1><h2 id="Contents"><a class="docs-heading-anchor" href="#Contents">Contents</a><a id="Contents-1"></a><a class="docs-heading-anchor-permalink" href="#Contents" title="Permalink"></a></h2><ul><li><a href="#API-Documentation">API Documentation</a></li><li class="no-marker"><ul><li><a href="#Contents">Contents</a></li><li><a href="#Index">Index</a></li><li><a href="#Types/Functors">Types/Functors</a></li><li><a href="#Functions">Functions</a></li></ul></li></ul><h2 id="Index"><a class="docs-heading-anchor" href="#Index">Index</a><a id="Index-1"></a><a class="docs-heading-anchor-permalink" href="#Index" title="Permalink"></a></h2><ul><li><a href="#CompressedBeliefMDPs.CompressedBeliefMDP"><code>CompressedBeliefMDPs.CompressedBeliefMDP</code></a></li><li><a href="#CompressedBeliefMDPs.CompressedBeliefPolicy"><code>CompressedBeliefMDPs.CompressedBeliefPolicy</code></a></li><li><a href="#CompressedBeliefMDPs.CompressedBeliefSolver"><code>CompressedBeliefMDPs.CompressedBeliefSolver</code></a></li><li><a href="#CompressedBeliefMDPs.Compressor"><code>CompressedBeliefMDPs.Compressor</code></a></li><li><a href="#CompressedBeliefMDPs.Sampler"><code>CompressedBeliefMDPs.Sampler</code></a></li><li><a href="#CompressedBeliefMDPs.compress_POMDP"><code>CompressedBeliefMDPs.compress_POMDP</code></a></li><li><a href="#CompressedBeliefMDPs.fit!"><code>CompressedBeliefMDPs.fit!</code></a></li><li><a href="#CompressedBeliefMDPs.make_cache"><code>CompressedBeliefMDPs.make_cache</code></a></li><li><a href="#CompressedBeliefMDPs.make_numerical"><code>CompressedBeliefMDPs.make_numerical</code></a></li></ul><h2 id="Types/Functors"><a class="docs-heading-anchor" href="#Types/Functors">Types/Functors</a><a id="Types/Functors-1"></a><a class="docs-heading-anchor-permalink" href="#Types/Functors" title="Permalink"></a></h2><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="CompressedBeliefMDPs.Sampler" href="#CompressedBeliefMDPs.Sampler"><code>CompressedBeliefMDPs.Sampler</code></a> — <span class="docstring-category">Type</span></header><section><div><p>Abstract type for an object that defines how the belief should be sampled.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaPOMDP/CompressedBeliefMDPs.jl/blob/c4b5a90c267220a5286710590c99327d0ab3e30c/src/samplers/samplers.jl#L1-L3">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="CompressedBeliefMDPs.Compressor" href="#CompressedBeliefMDPs.Compressor"><code>CompressedBeliefMDPs.Compressor</code></a> — <span class="docstring-category">Type</span></header><section><div><p>Abstract type for an object that defines how the belief should be compressed.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaPOMDP/CompressedBeliefMDPs.jl/blob/c4b5a90c267220a5286710590c99327d0ab3e30c/src/compressors/compressor.jl#L1-L3">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="CompressedBeliefMDPs.CompressedBeliefMDP" href="#CompressedBeliefMDPs.CompressedBeliefMDP"><code>CompressedBeliefMDPs.CompressedBeliefMDP</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">CompressedBeliefMDP{B, A}</code></pre><p>The <code>CompressedBeliefMDP</code> struct is a generalization of the compressed belief-state MDP presented in  <a href="https://papers.nips.cc/paper_files/paper/2002/hash/a11f9e533f28593768ebf87075ab34f2-Abstract.html">Exponential Family PCA for Belief Compression in POMDPs</a>.</p><p><strong>Type Parameters</strong></p><ul><li><code>B</code>: The type of compressed belief states.</li><li><code>A</code>: The type of actions.</li></ul><p><strong>Fields</strong></p><ul><li><code>bmdp::GenerativeBeliefMDP</code>: The generative belief-state MDP.</li><li><code>compressor::Compressor</code>: The compressor used to compress belief states.</li><li><code>ϕ::Bijection</code>: A bijection representing the mapping from uncompressed belief states to compressed belief states. See notes. </li></ul><p><strong>Constructors</strong></p><pre><code class="nohighlight hljs">CompressedBeliefMDP(pomdp::POMDP, updater::Updater, compressor::Compressor)
+<html lang="en"><head><meta charset="UTF-8"/><meta name="viewport" content="width=device-width, initial-scale=1.0"/><title>API Documentation · CompressedBeliefMDPs</title><meta name="title" content="API Documentation · CompressedBeliefMDPs"/><meta property="og:title" content="API Documentation · CompressedBeliefMDPs"/><meta property="twitter:title" content="API Documentation · CompressedBeliefMDPs"/><meta name="description" content="Documentation for CompressedBeliefMDPs."/><meta property="og:description" content="Documentation for CompressedBeliefMDPs."/><meta property="twitter:description" content="Documentation for CompressedBeliefMDPs."/><script data-outdated-warner src="../assets/warner.js"></script><link href="https://cdnjs.cloudflare.com/ajax/libs/lato-font/3.0.0/css/lato-font.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/juliamono/0.050/juliamono.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/fontawesome.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/solid.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/brands.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/KaTeX/0.16.8/katex.min.css" rel="stylesheet" type="text/css"/><script>documenterBaseURL=".."</script><script src="https://cdnjs.cloudflare.com/ajax/libs/require.js/2.3.6/require.min.js" data-main="../assets/documenter.js"></script><script src="../search_index.js"></script><script src="../siteinfo.js"></script><script src="../../versions.js"></script><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../assets/themes/documenter-dark.css" data-theme-name="documenter-dark" data-theme-primary-dark/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../assets/themes/documenter-light.css" data-theme-name="documenter-light" data-theme-primary/><script src="../assets/themeswap.js"></script></head><body><div id="documenter"><nav class="docs-sidebar"><div class="docs-package-name"><span class="docs-autofit"><a href="../">CompressedBeliefMDPs</a></span></div><button class="docs-search-query input is-rounded is-small is-clickable my-2 mx-auto py-1 px-2" id="documenter-search-query">Search docs (Ctrl + /)</button><ul class="docs-menu"><li><a class="tocitem" href="../">CompressedBeliefMDPs.jl</a></li><li><a class="tocitem" href="../samplers/">Samplers</a></li><li><a class="tocitem" href="../compressors/">Compressors</a></li><li><a class="tocitem" href="../circular/">Environments</a></li><li class="is-active"><a class="tocitem" href>API Documentation</a><ul class="internal"><li><a class="tocitem" href="#Contents"><span>Contents</span></a></li><li><a class="tocitem" href="#Index"><span>Index</span></a></li><li><a class="tocitem" href="#Types/Functors"><span>Types/Functors</span></a></li><li><a class="tocitem" href="#Functions"><span>Functions</span></a></li></ul></li></ul><div class="docs-version-selector field has-addons"><div class="control"><span class="docs-label button is-static is-size-7">Version</span></div><div class="docs-selector control is-expanded"><div class="select is-fullwidth is-size-7"><select id="documenter-version-selector"></select></div></div></div></nav><div class="docs-main"><header class="docs-navbar"><a class="docs-sidebar-button docs-navbar-link fa-solid fa-bars is-hidden-desktop" id="documenter-sidebar-button" href="#"></a><nav class="breadcrumb"><ul class="is-hidden-mobile"><li class="is-active"><a href>API Documentation</a></li></ul><ul class="is-hidden-tablet"><li class="is-active"><a href>API Documentation</a></li></ul></nav><div class="docs-right"><a class="docs-navbar-link" href="https://github.com/JuliaPOMDP/CompressedBeliefMDPs.jl" title="View the repository on GitHub"><span class="docs-icon fa-brands"></span><span class="docs-label is-hidden-touch">GitHub</span></a><a class="docs-navbar-link" href="https://github.com/JuliaPOMDP/CompressedBeliefMDPs.jl/blob/main/docs/src/api.md" title="Edit source on GitHub"><span class="docs-icon fa-solid"></span></a><a class="docs-settings-button docs-navbar-link fa-solid fa-gear" id="documenter-settings-button" href="#" title="Settings"></a><a class="docs-article-toggle-button fa-solid fa-chevron-up" id="documenter-article-toggle-button" href="javascript:;" title="Collapse all docstrings"></a></div></header><article class="content" id="documenter-page"><h1 id="API-Documentation"><a class="docs-heading-anchor" href="#API-Documentation">API Documentation</a><a id="API-Documentation-1"></a><a class="docs-heading-anchor-permalink" href="#API-Documentation" title="Permalink"></a></h1><h2 id="Contents"><a class="docs-heading-anchor" href="#Contents">Contents</a><a id="Contents-1"></a><a class="docs-heading-anchor-permalink" href="#Contents" title="Permalink"></a></h2><ul><li><a href="#API-Documentation">API Documentation</a></li><li class="no-marker"><ul><li><a href="#Contents">Contents</a></li><li><a href="#Index">Index</a></li><li><a href="#Types/Functors">Types/Functors</a></li><li><a href="#Functions">Functions</a></li></ul></li></ul><h2 id="Index"><a class="docs-heading-anchor" href="#Index">Index</a><a id="Index-1"></a><a class="docs-heading-anchor-permalink" href="#Index" title="Permalink"></a></h2><ul><li><a href="#CompressedBeliefMDPs.CompressedBeliefMDP"><code>CompressedBeliefMDPs.CompressedBeliefMDP</code></a></li><li><a href="#CompressedBeliefMDPs.CompressedBeliefPolicy"><code>CompressedBeliefMDPs.CompressedBeliefPolicy</code></a></li><li><a href="#CompressedBeliefMDPs.CompressedBeliefSolver"><code>CompressedBeliefMDPs.CompressedBeliefSolver</code></a></li><li><a href="#CompressedBeliefMDPs.Compressor"><code>CompressedBeliefMDPs.Compressor</code></a></li><li><a href="#CompressedBeliefMDPs.Sampler"><code>CompressedBeliefMDPs.Sampler</code></a></li><li><a href="#CompressedBeliefMDPs.compress_POMDP"><code>CompressedBeliefMDPs.compress_POMDP</code></a></li><li><a href="#CompressedBeliefMDPs.fit!"><code>CompressedBeliefMDPs.fit!</code></a></li><li><a href="#CompressedBeliefMDPs.make_cache"><code>CompressedBeliefMDPs.make_cache</code></a></li><li><a href="#CompressedBeliefMDPs.make_numerical"><code>CompressedBeliefMDPs.make_numerical</code></a></li></ul><h2 id="Types/Functors"><a class="docs-heading-anchor" href="#Types/Functors">Types/Functors</a><a id="Types/Functors-1"></a><a class="docs-heading-anchor-permalink" href="#Types/Functors" title="Permalink"></a></h2><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="CompressedBeliefMDPs.Sampler" href="#CompressedBeliefMDPs.Sampler"><code>CompressedBeliefMDPs.Sampler</code></a> — <span class="docstring-category">Type</span></header><section><div><p>Abstract type for an object that defines how the belief should be sampled.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaPOMDP/CompressedBeliefMDPs.jl/blob/9edfeef29b63574c75127f840336cf66659accf2/src/samplers/samplers.jl#L1-L3">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="CompressedBeliefMDPs.Compressor" href="#CompressedBeliefMDPs.Compressor"><code>CompressedBeliefMDPs.Compressor</code></a> — <span class="docstring-category">Type</span></header><section><div><p>Abstract type for an object that defines how the belief should be compressed.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaPOMDP/CompressedBeliefMDPs.jl/blob/9edfeef29b63574c75127f840336cf66659accf2/src/compressors/compressor.jl#L1-L3">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="CompressedBeliefMDPs.CompressedBeliefMDP" href="#CompressedBeliefMDPs.CompressedBeliefMDP"><code>CompressedBeliefMDPs.CompressedBeliefMDP</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">CompressedBeliefMDP{B, A}</code></pre><p>The <code>CompressedBeliefMDP</code> struct is a generalization of the compressed belief-state MDP presented in  <a href="https://papers.nips.cc/paper_files/paper/2002/hash/a11f9e533f28593768ebf87075ab34f2-Abstract.html">Exponential Family PCA for Belief Compression in POMDPs</a>.</p><p><strong>Type Parameters</strong></p><ul><li><code>B</code>: The type of compressed belief states.</li><li><code>A</code>: The type of actions.</li></ul><p><strong>Fields</strong></p><ul><li><code>bmdp::GenerativeBeliefMDP</code>: The generative belief-state MDP.</li><li><code>compressor::Compressor</code>: The compressor used to compress belief states.</li><li><code>ϕ::Bijection</code>: A bijection representing the mapping from uncompressed belief states to compressed belief states. See notes. </li></ul><p><strong>Constructors</strong></p><pre><code class="nohighlight hljs">CompressedBeliefMDP(pomdp::POMDP, updater::Updater, compressor::Compressor)
 CompressedBeliefMDP(pomdp::POMDP, sampler::Sampler, updater::Updater, compressor::Compressor)</code></pre><p>Constructs a <code>CompressedBeliefMDP</code> using the specified POMDP, updater, and compressor.</p><div class="admonition is-warning"><header class="admonition-header">Warning</header><div class="admonition-body"><p>The 4-argument constructor is a quality-of-life constructor that calls <a href="#CompressedBeliefMDPs.fit!"><code>fit!</code></a> on the given compressor. </p></div></div><p><strong>Example Usage</strong></p><pre><code class="language-julia hljs">pomdp = TigerPOMDP()
 updater = DiscreteUpdater(pomdp)
 compressor = PCACompressor(1)
-mdp = CompressedBeliefMDP(pomdp, updater, compressor)</code></pre><p>For continuous POMDPs, see <a href="https://juliapomdp.github.io/ParticleFilters.jl/latest/basic/">ParticleFilters.jl</a>.</p><p><strong>Notes</strong></p><ul><li>While compressions aren&#39;t usually injective, we cache beliefs and their compressions on a first-come, first-served basis, so we can effectively use a bijection without loss of generality.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaPOMDP/CompressedBeliefMDPs.jl/blob/c4b5a90c267220a5286710590c99327d0ab3e30c/src/cbmdp.jl#L1-L39">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="CompressedBeliefMDPs.CompressedBeliefPolicy" href="#CompressedBeliefMDPs.CompressedBeliefPolicy"><code>CompressedBeliefMDPs.CompressedBeliefPolicy</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">CompressedBeliefPolicy</code></pre><p>Maps a base policy for the compressed belief-state MDP to a policy for the true POMDP.</p><p><strong>Fields</strong></p><ul><li><code>m::CompressedBeliefMDP</code>: The compressed belief-state MDP.</li><li><code>base_policy::Policy</code>: The base policy used for decision-making in the compressed belief-state MDP.</li></ul><p><strong>Constructors</strong></p><pre><code class="nohighlight hljs">CompressedBeliefPolicy(m::CompressedBeliefMDP, base_policy::Policy)</code></pre><p>Constructs a <code>CompressedBeliefPolicy</code> using the specified compressed belief-state MDP and base policy.</p><p><strong>Example Usage</strong></p><pre><code class="language-julia hljs">policy = solve(solver, pomdp)
+mdp = CompressedBeliefMDP(pomdp, updater, compressor)</code></pre><p>For continuous POMDPs, see <a href="https://juliapomdp.github.io/ParticleFilters.jl/latest/basic/">ParticleFilters.jl</a>.</p><p><strong>Notes</strong></p><ul><li>While compressions aren&#39;t usually injective, we cache beliefs and their compressions on a first-come, first-served basis, so we can effectively use a bijection without loss of generality.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaPOMDP/CompressedBeliefMDPs.jl/blob/9edfeef29b63574c75127f840336cf66659accf2/src/cbmdp.jl#L1-L39">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="CompressedBeliefMDPs.CompressedBeliefPolicy" href="#CompressedBeliefMDPs.CompressedBeliefPolicy"><code>CompressedBeliefMDPs.CompressedBeliefPolicy</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">CompressedBeliefPolicy</code></pre><p>Maps a base policy for the compressed belief-state MDP to a policy for the true POMDP.</p><p><strong>Fields</strong></p><ul><li><code>m::CompressedBeliefMDP</code>: The compressed belief-state MDP.</li><li><code>base_policy::Policy</code>: The base policy used for decision-making in the compressed belief-state MDP.</li></ul><p><strong>Constructors</strong></p><pre><code class="nohighlight hljs">CompressedBeliefPolicy(m::CompressedBeliefMDP, base_policy::Policy)</code></pre><p>Constructs a <code>CompressedBeliefPolicy</code> using the specified compressed belief-state MDP and base policy.</p><p><strong>Example Usage</strong></p><pre><code class="language-julia hljs">policy = solve(solver, pomdp)
 s = initialstate(pomdp)
 a = action(policy, s) # returns the approximately optimal action for state s
-v = value(policy, s)  # returns the approximately optimal value for state s</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaPOMDP/CompressedBeliefMDPs.jl/blob/c4b5a90c267220a5286710590c99327d0ab3e30c/src/solver.jl#L3-L25">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="CompressedBeliefMDPs.CompressedBeliefSolver" href="#CompressedBeliefMDPs.CompressedBeliefSolver"><code>CompressedBeliefMDPs.CompressedBeliefSolver</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">CompressedBeliefSolver</code></pre><p>The <code>CompressedBeliefSolver</code> struct represents a solver for compressed belief-state MDPs. It combines a compressed belief-state MDP with a base solver to approximate the value function.</p><p><strong>Fields</strong></p><ul><li><code>m::CompressedBeliefMDP</code>: The compressed belief-state MDP.</li><li><code>base_solver::Solver</code>: The base solver used to solve the compressed belief-state MDP.</li></ul><p><strong>Constructors</strong></p><pre><code class="nohighlight hljs">CompressedBeliefSolver(pomdp::POMDP, base_solver::Solver; updater::Updater=DiscreteUpdater(pomdp), sampler::Sampler=BeliefExpansionSampler(pomdp), compressor::Compressor=PCACompressor(1))
+v = value(policy, s)  # returns the approximately optimal value for state s</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaPOMDP/CompressedBeliefMDPs.jl/blob/9edfeef29b63574c75127f840336cf66659accf2/src/solver.jl#L3-L25">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="CompressedBeliefMDPs.CompressedBeliefSolver" href="#CompressedBeliefMDPs.CompressedBeliefSolver"><code>CompressedBeliefMDPs.CompressedBeliefSolver</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">CompressedBeliefSolver</code></pre><p>The <code>CompressedBeliefSolver</code> struct represents a solver for compressed belief-state MDPs. It combines a compressed belief-state MDP with a base solver to approximate the value function.</p><p><strong>Fields</strong></p><ul><li><code>m::CompressedBeliefMDP</code>: The compressed belief-state MDP.</li><li><code>base_solver::Solver</code>: The base solver used to solve the compressed belief-state MDP.</li></ul><p><strong>Constructors</strong></p><pre><code class="nohighlight hljs">CompressedBeliefSolver(pomdp::POMDP, base_solver::Solver; updater::Updater=DiscreteUpdater(pomdp), sampler::Sampler=BeliefExpansionSampler(pomdp), compressor::Compressor=PCACompressor(1))
 CompressedBeliefSolver(pomdp::POMDP; updater::Updater=DiscreteUpdater(pomdp), sampler::Sampler=BeliefExpansionSampler(pomdp), compressor::Compressor=PCACompressor(1), interp::Union{Nothing, LocalFunctionApproximator}=nothing, k::Int=1, verbose::Bool=false, max_iterations::Int=1000, n_generative_samples::Int=10, belres::Float64=1e-3)</code></pre><p>Constructs a <code>CompressedBeliefSolver</code> using the specified POMDP, base solver, updater, sampler, and compressor. Alternatively, you can omit the base solver in which case a <code>LocalApproximationValueIterationSolver</code>(https://github.com/JuliaPOMDP/LocalApproximationValueIteration.jl) will be created instead. For example, different base solvers are needed if the POMDP state and action space are continuous.</p><p><strong>Example Usage</strong></p><pre><code class="language-julia-repl hljs">julia&gt; pomdp = TigerPOMDP();
 julia&gt; solver = CompressedBeliefSolver(pomdp; verbose=true, max_iterations=10);
 julia&gt; solve(solver, pomdp);
@@ -19,7 +19,7 @@
 [Iteration 7   ] residual:       6.03 | iteration runtime:      0.495 ms, (     0.639 s total)
 [Iteration 8   ] residual:       5.73 | iteration runtime:      0.585 ms, (     0.639 s total)
 [Iteration 9   ] residual:       4.02 | iteration runtime:      0.463 ms, (      0.64 s total)
-[Iteration 10  ] residual:       7.28 | iteration runtime:      0.576 ms, (      0.64 s total)</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaPOMDP/CompressedBeliefMDPs.jl/blob/c4b5a90c267220a5286710590c99327d0ab3e30c/src/solver.jl#L52-L86">source</a></section></article><h2 id="Functions"><a class="docs-heading-anchor" href="#Functions">Functions</a><a id="Functions-1"></a><a class="docs-heading-anchor-permalink" href="#Functions" title="Permalink"></a></h2><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="CompressedBeliefMDPs.fit!" href="#CompressedBeliefMDPs.fit!"><code>CompressedBeliefMDPs.fit!</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">fit!(compressor::Compressor, beliefs)</code></pre><p>Fit the compressor to beliefs.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaPOMDP/CompressedBeliefMDPs.jl/blob/c4b5a90c267220a5286710590c99327d0ab3e30c/src/compressors/compressor.jl#L6-L10">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="CompressedBeliefMDPs.make_cache" href="#CompressedBeliefMDPs.make_cache"><code>CompressedBeliefMDPs.make_cache</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">make_cache(B, B̃)</code></pre><p>Helper function that creates a cache that maps each unique belief from the set <code>B</code> to its corresponding compressed representation in <code>B̃</code>.</p><p><strong>Arguments</strong></p><ul><li><code>B::Vector{&lt;:Any}</code>: A vector of beliefs.</li><li><code>B̃::Matrix{Float64}</code>: A matrix where each row corresponds to the compressed representation of the beliefs in <code>B</code>.</li></ul><p><strong>Returns</strong></p><ul><li><code>Dict{&lt;:Any, Vector{Float64}}</code>: A dictionary mapping each unique belief in <code>B</code> to its corresponding compressed representation in <code>B̃</code>.</li></ul><p><strong>Example Usage</strong></p><pre><code class="language-julia hljs">B = [belief1, belief2, belief3]
+[Iteration 10  ] residual:       7.28 | iteration runtime:      0.576 ms, (      0.64 s total)</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaPOMDP/CompressedBeliefMDPs.jl/blob/9edfeef29b63574c75127f840336cf66659accf2/src/solver.jl#L52-L86">source</a></section></article><h2 id="Functions"><a class="docs-heading-anchor" href="#Functions">Functions</a><a id="Functions-1"></a><a class="docs-heading-anchor-permalink" href="#Functions" title="Permalink"></a></h2><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="CompressedBeliefMDPs.fit!" href="#CompressedBeliefMDPs.fit!"><code>CompressedBeliefMDPs.fit!</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">fit!(compressor::Compressor, beliefs)</code></pre><p>Fit the compressor to beliefs.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaPOMDP/CompressedBeliefMDPs.jl/blob/9edfeef29b63574c75127f840336cf66659accf2/src/compressors/compressor.jl#L6-L10">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="CompressedBeliefMDPs.make_cache" href="#CompressedBeliefMDPs.make_cache"><code>CompressedBeliefMDPs.make_cache</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">make_cache(B, B̃)</code></pre><p>Helper function that creates a cache that maps each unique belief from the set <code>B</code> to its corresponding compressed representation in <code>B̃</code>.</p><p><strong>Arguments</strong></p><ul><li><code>B::Vector{&lt;:Any}</code>: A vector of beliefs.</li><li><code>B̃::Matrix{Float64}</code>: A matrix where each row corresponds to the compressed representation of the beliefs in <code>B</code>.</li></ul><p><strong>Returns</strong></p><ul><li><code>Dict{&lt;:Any, Vector{Float64}}</code>: A dictionary mapping each unique belief in <code>B</code> to its corresponding compressed representation in <code>B̃</code>.</li></ul><p><strong>Example Usage</strong></p><pre><code class="language-julia hljs">B = [belief1, belief2, belief3]
 B̃ = [compressed_belief1; compressed_belief2; compressed_belief3]
-ϕ = make_cache(B, B̃)</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaPOMDP/CompressedBeliefMDPs.jl/blob/c4b5a90c267220a5286710590c99327d0ab3e30c/src/utils.jl#L1-L19">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="CompressedBeliefMDPs.make_numerical" href="#CompressedBeliefMDPs.make_numerical"><code>CompressedBeliefMDPs.make_numerical</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">make_numerical(B, pomdp)</code></pre><p>Helper function that converts a set of beliefs <code>B</code> into a numerical matrix representation suitable for processing by numerical algorithms/compressors.</p><p><strong>Arguments</strong></p><ul><li><code>B::Vector{&lt;:Any}</code>: A vector of beliefs.</li><li><code>pomdp::POMDP</code>: The POMDP model associated with the beliefs.</li></ul><p><strong>Returns</strong></p><ul><li><code>Matrix{Float64}</code>: A matrix where each row corresponds to a numerical representation of a belief in <code>B</code>.</li></ul><p><strong>Example Usage</strong></p><pre><code class="language-julia hljs">B = [belief1, belief2, belief3]
-B_numerical = make_numerical(B, pomdp)</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaPOMDP/CompressedBeliefMDPs.jl/blob/c4b5a90c267220a5286710590c99327d0ab3e30c/src/utils.jl#L25-L42">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="CompressedBeliefMDPs.compress_POMDP" href="#CompressedBeliefMDPs.compress_POMDP"><code>CompressedBeliefMDPs.compress_POMDP</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">compress_POMDP(pomdp, sampler, updater, compressor)</code></pre><p>Creates a compressed belief-state MDP by sampling, compressing, and caching beliefs from the given POMDP.</p><p><strong>Arguments</strong></p><ul><li><code>pomdp::POMDP</code>: The POMDP model to be compressed.</li><li><code>sampler::Sampler</code>: A sampler to generate a set of beliefs from the POMDP.</li><li><code>updater::Updater</code>: An updater to initialize beliefs from states.</li><li><code>compressor::Compressor</code>: A compressor to reduce the dimensionality of the beliefs.</li></ul><p><strong>Returns</strong></p><ul><li><code>CompressedBeliefMDP</code>: The constructed compressed belief-state MDP.</li><li><code>Matrix{Float64}</code>: A matrix where each row corresponds to the compressed representation of the sampled beliefs.</li></ul><p><strong>Example Usage</strong></p><p>```julia pomdp = TigerPOMDP() sampler = BeliefExpansionSampler(pomdp) updater = DiscreteUpdater(pomdp) compressor = PCACompressor(2) m, B̃ = compress_POMDP(pomdp, sampler, updater, compressor)</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaPOMDP/CompressedBeliefMDPs.jl/blob/c4b5a90c267220a5286710590c99327d0ab3e30c/src/utils.jl#L48-L70">source</a></section></article></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../circular/">« Environments</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="auto">Automatic (OS)</option><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.3.0 on <span class="colophon-date" title="Wednesday 24 July 2024 22:30">Wednesday 24 July 2024</span>. Using Julia version 1.10.4.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
+ϕ = make_cache(B, B̃)</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaPOMDP/CompressedBeliefMDPs.jl/blob/9edfeef29b63574c75127f840336cf66659accf2/src/utils.jl#L1-L19">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="CompressedBeliefMDPs.make_numerical" href="#CompressedBeliefMDPs.make_numerical"><code>CompressedBeliefMDPs.make_numerical</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">make_numerical(B, pomdp)</code></pre><p>Helper function that converts a set of beliefs <code>B</code> into a numerical matrix representation suitable for processing by numerical algorithms/compressors.</p><p><strong>Arguments</strong></p><ul><li><code>B::Vector{&lt;:Any}</code>: A vector of beliefs.</li><li><code>pomdp::POMDP</code>: The POMDP model associated with the beliefs.</li></ul><p><strong>Returns</strong></p><ul><li><code>Matrix{Float64}</code>: A matrix where each row corresponds to a numerical representation of a belief in <code>B</code>.</li></ul><p><strong>Example Usage</strong></p><pre><code class="language-julia hljs">B = [belief1, belief2, belief3]
+B_numerical = make_numerical(B, pomdp)</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaPOMDP/CompressedBeliefMDPs.jl/blob/9edfeef29b63574c75127f840336cf66659accf2/src/utils.jl#L25-L42">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="CompressedBeliefMDPs.compress_POMDP" href="#CompressedBeliefMDPs.compress_POMDP"><code>CompressedBeliefMDPs.compress_POMDP</code></a> — <span class="docstring-category">Function</span></header><section><div><pre><code class="language-julia hljs">compress_POMDP(pomdp, sampler, updater, compressor)</code></pre><p>Creates a compressed belief-state MDP by sampling, compressing, and caching beliefs from the given POMDP.</p><p><strong>Arguments</strong></p><ul><li><code>pomdp::POMDP</code>: The POMDP model to be compressed.</li><li><code>sampler::Sampler</code>: A sampler to generate a set of beliefs from the POMDP.</li><li><code>updater::Updater</code>: An updater to initialize beliefs from states.</li><li><code>compressor::Compressor</code>: A compressor to reduce the dimensionality of the beliefs.</li></ul><p><strong>Returns</strong></p><ul><li><code>CompressedBeliefMDP</code>: The constructed compressed belief-state MDP.</li><li><code>Matrix{Float64}</code>: A matrix where each row corresponds to the compressed representation of the sampled beliefs.</li></ul><p><strong>Example Usage</strong></p><p>```julia pomdp = TigerPOMDP() sampler = BeliefExpansionSampler(pomdp) updater = DiscreteUpdater(pomdp) compressor = PCACompressor(2) m, B̃ = compress_POMDP(pomdp, sampler, updater, compressor)</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaPOMDP/CompressedBeliefMDPs.jl/blob/9edfeef29b63574c75127f840336cf66659accf2/src/utils.jl#L48-L70">source</a></section></article></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../circular/">« Environments</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="auto">Automatic (OS)</option><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.3.0 on <span class="colophon-date" title="Thursday 1 August 2024 17:43">Thursday 1 August 2024</span>. Using Julia version 1.10.4.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
diff --git a/dev/circular/index.html b/dev/circular/index.html
index 1d5aa53..5b39c01 100644
--- a/dev/circular/index.html
+++ b/dev/circular/index.html
@@ -1,8 +1,8 @@
 <!DOCTYPE html>
-<html lang="en"><head><meta charset="UTF-8"/><meta name="viewport" content="width=device-width, initial-scale=1.0"/><title>Environments · CompressedBeliefMDPs</title><meta name="title" content="Environments · CompressedBeliefMDPs"/><meta property="og:title" content="Environments · CompressedBeliefMDPs"/><meta property="twitter:title" content="Environments · CompressedBeliefMDPs"/><meta name="description" content="Documentation for CompressedBeliefMDPs."/><meta property="og:description" content="Documentation for CompressedBeliefMDPs."/><meta property="twitter:description" content="Documentation for CompressedBeliefMDPs."/><script data-outdated-warner src="../assets/warner.js"></script><link href="https://cdnjs.cloudflare.com/ajax/libs/lato-font/3.0.0/css/lato-font.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/juliamono/0.050/juliamono.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/fontawesome.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/solid.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/brands.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/KaTeX/0.16.8/katex.min.css" rel="stylesheet" type="text/css"/><script>documenterBaseURL=".."</script><script src="https://cdnjs.cloudflare.com/ajax/libs/require.js/2.3.6/require.min.js" data-main="../assets/documenter.js"></script><script src="../search_index.js"></script><script src="../siteinfo.js"></script><script src="../../versions.js"></script><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../assets/themes/documenter-dark.css" data-theme-name="documenter-dark" data-theme-primary-dark/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../assets/themes/documenter-light.css" data-theme-name="documenter-light" data-theme-primary/><script src="../assets/themeswap.js"></script></head><body><div id="documenter"><nav class="docs-sidebar"><div class="docs-package-name"><span class="docs-autofit"><a href="../">CompressedBeliefMDPs</a></span></div><button class="docs-search-query input is-rounded is-small is-clickable my-2 mx-auto py-1 px-2" id="documenter-search-query">Search docs (Ctrl + /)</button><ul class="docs-menu"><li><a class="tocitem" href="../">CompressedBeliefMDPs.jl</a></li><li><a class="tocitem" href="../samplers/">Samplers</a></li><li><a class="tocitem" href="../compressors/">Compressors</a></li><li class="is-active"><a class="tocitem" href>Environments</a><ul class="internal"><li><a class="tocitem" href="#Description"><span>Description</span></a></li><li><a class="tocitem" href="#Action-Space"><span>Action Space</span></a></li><li><a class="tocitem" href="#State-Space"><span>State Space</span></a></li><li><a class="tocitem" href="#Observation-Space"><span>Observation Space</span></a></li><li><a class="tocitem" href="#Rewards"><span>Rewards</span></a></li><li><a class="tocitem" href="#Starting-State"><span>Starting State</span></a></li><li><a class="tocitem" href="#Episode-End"><span>Episode End</span></a></li><li><a class="tocitem" href="#Documentation"><span>Documentation</span></a></li></ul></li><li><a class="tocitem" href="../api/">API Documentation</a></li></ul><div class="docs-version-selector field has-addons"><div class="control"><span class="docs-label button is-static is-size-7">Version</span></div><div class="docs-selector control is-expanded"><div class="select is-fullwidth is-size-7"><select id="documenter-version-selector"></select></div></div></div></nav><div class="docs-main"><header class="docs-navbar"><a class="docs-sidebar-button docs-navbar-link fa-solid fa-bars is-hidden-desktop" id="documenter-sidebar-button" href="#"></a><nav class="breadcrumb"><ul class="is-hidden-mobile"><li class="is-active"><a href>Environments</a></li></ul><ul class="is-hidden-tablet"><li class="is-active"><a href>Environments</a></li></ul></nav><div class="docs-right"><a class="docs-navbar-link" href="https://github.com/JuliaPOMDP/CompressedBeliefMDPs.jl" title="View the repository on GitHub"><span class="docs-icon fa-brands"></span><span class="docs-label is-hidden-touch">GitHub</span></a><a class="docs-navbar-link" href="https://github.com/JuliaPOMDP/CompressedBeliefMDPs.jl/blob/main/docs/src/circular.md" title="Edit source on GitHub"><span class="docs-icon fa-solid"></span></a><a class="docs-settings-button docs-navbar-link fa-solid fa-gear" id="documenter-settings-button" href="#" title="Settings"></a><a class="docs-article-toggle-button fa-solid fa-chevron-up" id="documenter-article-toggle-button" href="javascript:;" title="Collapse all docstrings"></a></div></header><article class="content" id="documenter-page"><h1 id="Circular-Maze"><a class="docs-heading-anchor" href="#Circular-Maze">Circular Maze</a><a id="Circular-Maze-1"></a><a class="docs-heading-anchor-permalink" href="#Circular-Maze" title="Permalink"></a></h1><h2 id="Description"><a class="docs-heading-anchor" href="#Description">Description</a><a id="Description-1"></a><a class="docs-heading-anchor-permalink" href="#Description" title="Permalink"></a></h2><p>This environment is a generalization of the Circular Maze POMDP described in <a href="https://arxiv.org/abs/1107.0053">Finding Approximate POMDP solutions Through Belief Compression</a>.<sup class="footnote-reference"><a id="citeref-1" href="#footnote-1">[1]</a></sup> The world consists of <code>n_corridor</code> 1D circular corridors that each have <code>corridor_length</code> states. The robot spawns in a random corridor. It must determine which corridor its in, navigate to the proper goal state, and finally declare that it has finished.</p><p><img src="../assets/maze.png" alt/></p><p>*Figure from <a href="https://arxiv.org/abs/1107.0053">Finding Approximate POMDP solutions Through Belief Compression</a>.</p><h2 id="Action-Space"><a class="docs-heading-anchor" href="#Action-Space">Action Space</a><a id="Action-Space-1"></a><a class="docs-heading-anchor-permalink" href="#Action-Space" title="Permalink"></a></h2><p>Transitions left and right are noisy and non-deterministic. Transition probabilities are from a discrete von Mises distribution with unit concentration and mean at the target state. </p><table><tr><th style="text-align: right">Num</th><th style="text-align: right">Action</th><th style="text-align: right">Description</th></tr><tr><td style="text-align: right">1</td><td style="text-align: right"><code>CMAZE_LEFT</code></td><td style="text-align: right">Move left with von Mises noise.</td></tr><tr><td style="text-align: right">2</td><td style="text-align: right"><code>CMAZE_RIGHT</code></td><td style="text-align: right">Move right with von Mises noise.</td></tr><tr><td style="text-align: right">3</td><td style="text-align: right"><code>CMAZE_SENSE_CORRIDOR</code></td><td style="text-align: right">Observe the current corridor.</td></tr><tr><td style="text-align: right">4</td><td style="text-align: right"><code>CMAZE_DECLARE_GOAL</code></td><td style="text-align: right">Ends the episode. Receive <code>r_findgoal</code> if at the goal.</td></tr></table><h2 id="State-Space"><a class="docs-heading-anchor" href="#State-Space">State Space</a><a id="State-Space-1"></a><a class="docs-heading-anchor-permalink" href="#State-Space" title="Permalink"></a></h2><p>The (ordered) state space is an array of all <code>CircularMazeState</code>s and a <code>terminalstate</code>: <code>[CircularMaze(1, 1), ..., CircularMaze(n_corridors, corridor_length), TerminalState()]</code>.</p><h2 id="Observation-Space"><a class="docs-heading-anchor" href="#Observation-Space">Observation Space</a><a id="Observation-Space-1"></a><a class="docs-heading-anchor-permalink" href="#Observation-Space" title="Permalink"></a></h2><p>The observation space is the union of the state space and <code>1:n_corridors</code>. If the robot picks <code>CMAZE_SENSE_CORRIDOR</code>, they observe the index of the current corridor. Otherwise, they observe their current state with von Mises noise.</p><h2 id="Rewards"><a class="docs-heading-anchor" href="#Rewards">Rewards</a><a id="Rewards-1"></a><a class="docs-heading-anchor-permalink" href="#Rewards" title="Permalink"></a></h2><p>The goal is to navigate to the correct goal state for the given corridor and then to declare the goal once arrived. If the robot correctly declares the goal, it receives <code>r_findgoal</code>. It incurs a <code>r_timestep_penalty</code> for every timestep it does not reach the goal. By default <code>r_findgoal</code> is 1 and <code>r_timestep_penalty</code> is 0. </p><h2 id="Starting-State"><a class="docs-heading-anchor" href="#Starting-State">Starting State</a><a id="Starting-State-1"></a><a class="docs-heading-anchor-permalink" href="#Starting-State" title="Permalink"></a></h2><p>The initial state is sampled from a repeated, discrete von Mises distribution each with a concentration at the center of the hallway. </p><p><img src="../assets/initial_belief.png" alt/></p><h2 id="Episode-End"><a class="docs-heading-anchor" href="#Episode-End">Episode End</a><a id="Episode-End-1"></a><a class="docs-heading-anchor-permalink" href="#Episode-End" title="Permalink"></a></h2><p>The episode terminates once the robot declares the goal <code>CMAZE_DECLARE_GOAL</code> <em>regardless</em> of whether the robot is correct.</p><h2 id="Documentation"><a class="docs-heading-anchor" href="#Documentation">Documentation</a><a id="Documentation-1"></a><a class="docs-heading-anchor-permalink" href="#Documentation" title="Permalink"></a></h2><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="CompressedBeliefMDPs.CircularMaze" href="#CompressedBeliefMDPs.CircularMaze"><code>CompressedBeliefMDPs.CircularMaze</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">CircularMaze(n_corridors::Integer, corridor_length::Integer, discount::Float64, r_findgoal::Float64, r_timestep_penalty::Float64)
+<html lang="en"><head><meta charset="UTF-8"/><meta name="viewport" content="width=device-width, initial-scale=1.0"/><title>Environments · CompressedBeliefMDPs</title><meta name="title" content="Environments · CompressedBeliefMDPs"/><meta property="og:title" content="Environments · CompressedBeliefMDPs"/><meta property="twitter:title" content="Environments · CompressedBeliefMDPs"/><meta name="description" content="Documentation for CompressedBeliefMDPs."/><meta property="og:description" content="Documentation for CompressedBeliefMDPs."/><meta property="twitter:description" content="Documentation for CompressedBeliefMDPs."/><script data-outdated-warner src="../assets/warner.js"></script><link href="https://cdnjs.cloudflare.com/ajax/libs/lato-font/3.0.0/css/lato-font.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/juliamono/0.050/juliamono.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/fontawesome.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/solid.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/6.4.2/css/brands.min.css" rel="stylesheet" type="text/css"/><link href="https://cdnjs.cloudflare.com/ajax/libs/KaTeX/0.16.8/katex.min.css" rel="stylesheet" type="text/css"/><script>documenterBaseURL=".."</script><script src="https://cdnjs.cloudflare.com/ajax/libs/require.js/2.3.6/require.min.js" data-main="../assets/documenter.js"></script><script src="../search_index.js"></script><script src="../siteinfo.js"></script><script src="../../versions.js"></script><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../assets/themes/documenter-dark.css" data-theme-name="documenter-dark" data-theme-primary-dark/><link class="docs-theme-link" rel="stylesheet" type="text/css" href="../assets/themes/documenter-light.css" data-theme-name="documenter-light" data-theme-primary/><script src="../assets/themeswap.js"></script></head><body><div id="documenter"><nav class="docs-sidebar"><div class="docs-package-name"><span class="docs-autofit"><a href="../">CompressedBeliefMDPs</a></span></div><button class="docs-search-query input is-rounded is-small is-clickable my-2 mx-auto py-1 px-2" id="documenter-search-query">Search docs (Ctrl + /)</button><ul class="docs-menu"><li><a class="tocitem" href="../">CompressedBeliefMDPs.jl</a></li><li><a class="tocitem" href="../samplers/">Samplers</a></li><li><a class="tocitem" href="../compressors/">Compressors</a></li><li class="is-active"><a class="tocitem" href>Environments</a><ul class="internal"><li><a class="tocitem" href="#Description"><span>Description</span></a></li><li><a class="tocitem" href="#Action-Space"><span>Action Space</span></a></li><li><a class="tocitem" href="#State-Space"><span>State Space</span></a></li><li><a class="tocitem" href="#Observation-Space"><span>Observation Space</span></a></li><li><a class="tocitem" href="#Rewards"><span>Rewards</span></a></li><li><a class="tocitem" href="#Starting-State"><span>Starting State</span></a></li><li><a class="tocitem" href="#Episode-End"><span>Episode End</span></a></li><li><a class="tocitem" href="#Documentation"><span>Documentation</span></a></li></ul></li><li><a class="tocitem" href="../api/">API Documentation</a></li></ul><div class="docs-version-selector field has-addons"><div class="control"><span class="docs-label button is-static is-size-7">Version</span></div><div class="docs-selector control is-expanded"><div class="select is-fullwidth is-size-7"><select id="documenter-version-selector"></select></div></div></div></nav><div class="docs-main"><header class="docs-navbar"><a class="docs-sidebar-button docs-navbar-link fa-solid fa-bars is-hidden-desktop" id="documenter-sidebar-button" href="#"></a><nav class="breadcrumb"><ul class="is-hidden-mobile"><li class="is-active"><a href>Environments</a></li></ul><ul class="is-hidden-tablet"><li class="is-active"><a href>Environments</a></li></ul></nav><div class="docs-right"><a class="docs-navbar-link" href="https://github.com/JuliaPOMDP/CompressedBeliefMDPs.jl" title="View the repository on GitHub"><span class="docs-icon fa-brands"></span><span class="docs-label is-hidden-touch">GitHub</span></a><a class="docs-navbar-link" href="https://github.com/JuliaPOMDP/CompressedBeliefMDPs.jl/blob/main/docs/src/circular.md" title="Edit source on GitHub"><span class="docs-icon fa-solid"></span></a><a class="docs-settings-button docs-navbar-link fa-solid fa-gear" id="documenter-settings-button" href="#" title="Settings"></a><a class="docs-article-toggle-button fa-solid fa-chevron-up" id="documenter-article-toggle-button" href="javascript:;" title="Collapse all docstrings"></a></div></header><article class="content" id="documenter-page"><h1 id="Circular-Maze"><a class="docs-heading-anchor" href="#Circular-Maze">Circular Maze</a><a id="Circular-Maze-1"></a><a class="docs-heading-anchor-permalink" href="#Circular-Maze" title="Permalink"></a></h1><h2 id="Description"><a class="docs-heading-anchor" href="#Description">Description</a><a id="Description-1"></a><a class="docs-heading-anchor-permalink" href="#Description" title="Permalink"></a></h2><p>This environment is a generalization of the Circular Maze POMDP described in <a href="https://arxiv.org/abs/1107.0053">Finding Approximate POMDP solutions Through Belief Compression</a>.<sup class="footnote-reference"><a id="citeref-1" href="#footnote-1">[1]</a></sup> The world consists of <code>n_corridor</code> 1D circular corridors that each have <code>corridor_length</code> states. The robot spawns in a random corridor. It must determine which corridor its in, navigate to the proper goal state, and finally declare that it has finished.</p><p><img src="../assets/maze.png" alt/></p><p>Figure from <a href="https://arxiv.org/abs/1107.0053">Finding Approximate POMDP solutions Through Belief Compression</a>.</p><h2 id="Action-Space"><a class="docs-heading-anchor" href="#Action-Space">Action Space</a><a id="Action-Space-1"></a><a class="docs-heading-anchor-permalink" href="#Action-Space" title="Permalink"></a></h2><p>Transitions left and right are noisy and non-deterministic. Transition probabilities are from a discrete von Mises distribution with unit concentration and mean at the target state. </p><table><tr><th style="text-align: right">Num</th><th style="text-align: right">Action</th><th style="text-align: right">Description</th></tr><tr><td style="text-align: right">1</td><td style="text-align: right"><code>CMAZE_LEFT</code></td><td style="text-align: right">Move left with von Mises noise.</td></tr><tr><td style="text-align: right">2</td><td style="text-align: right"><code>CMAZE_RIGHT</code></td><td style="text-align: right">Move right with von Mises noise.</td></tr><tr><td style="text-align: right">3</td><td style="text-align: right"><code>CMAZE_SENSE_CORRIDOR</code></td><td style="text-align: right">Observe the current corridor.</td></tr><tr><td style="text-align: right">4</td><td style="text-align: right"><code>CMAZE_DECLARE_GOAL</code></td><td style="text-align: right">Ends the episode. Receive <code>r_findgoal</code> if at the goal.</td></tr></table><h2 id="State-Space"><a class="docs-heading-anchor" href="#State-Space">State Space</a><a id="State-Space-1"></a><a class="docs-heading-anchor-permalink" href="#State-Space" title="Permalink"></a></h2><p>The (ordered) state space is an array of all <code>CircularMazeState</code>s and a <code>terminalstate</code>: <code>[CircularMaze(1, 1), ..., CircularMaze(n_corridors, corridor_length), TerminalState()]</code>.</p><h2 id="Observation-Space"><a class="docs-heading-anchor" href="#Observation-Space">Observation Space</a><a id="Observation-Space-1"></a><a class="docs-heading-anchor-permalink" href="#Observation-Space" title="Permalink"></a></h2><p>The observation space is the union of the state space and <code>1:n_corridors</code>. If the robot picks <code>CMAZE_SENSE_CORRIDOR</code>, they observe the index of the current corridor. Otherwise, they observe their current state with von Mises noise.</p><h2 id="Rewards"><a class="docs-heading-anchor" href="#Rewards">Rewards</a><a id="Rewards-1"></a><a class="docs-heading-anchor-permalink" href="#Rewards" title="Permalink"></a></h2><p>The goal is to navigate to the correct goal state for the given corridor and then to declare the goal once arrived. If the robot correctly declares the goal, it receives <code>r_findgoal</code>. It incurs a <code>r_timestep_penalty</code> for every timestep it does not reach the goal. By default <code>r_findgoal</code> is 1 and <code>r_timestep_penalty</code> is 0. </p><h2 id="Starting-State"><a class="docs-heading-anchor" href="#Starting-State">Starting State</a><a id="Starting-State-1"></a><a class="docs-heading-anchor-permalink" href="#Starting-State" title="Permalink"></a></h2><p>The initial state is sampled from a repeated, discrete von Mises distribution each with a concentration at the center of the hallway. </p><p><img src="../assets/initial_belief.png" alt/></p><h2 id="Episode-End"><a class="docs-heading-anchor" href="#Episode-End">Episode End</a><a id="Episode-End-1"></a><a class="docs-heading-anchor-permalink" href="#Episode-End" title="Permalink"></a></h2><p>The episode terminates once the robot declares the goal <code>CMAZE_DECLARE_GOAL</code> <em>regardless</em> of whether the robot is correct.</p><h2 id="Documentation"><a class="docs-heading-anchor" href="#Documentation">Documentation</a><a id="Documentation-1"></a><a class="docs-heading-anchor-permalink" href="#Documentation" title="Permalink"></a></h2><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="CompressedBeliefMDPs.CircularMaze" href="#CompressedBeliefMDPs.CircularMaze"><code>CompressedBeliefMDPs.CircularMaze</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">CircularMaze(n_corridors::Integer, corridor_length::Integer, discount::Float64, r_findgoal::Float64, r_timestep_penalty::Float64)
 CircularMaze(n_corridors::Integer, corridor_length::Integer; kwargs...)
 CircularMaze()</code></pre><p>A POMDP representing a circular maze environment.</p><p><strong>Fields</strong></p><ul><li><code>n_corridors::Integer</code>: Number of corridors in the circular maze.</li><li><code>corridor_length::Integer</code>: Length of each corridor.</li><li><code>probabilities::AbstractArray</code>: Probability masses for creating von Mises distributions.</li><li><code>center::Integer</code>: The central position in the maze.</li><li><code>discount::Float64</code>: Discount factor for future rewards.</li><li><code>r_findgoal::Float64</code>: Reward for finding the goal.</li><li><code>r_timestep_penalty::Float64</code>: Penalty for each timestep taken.</li><li><code>states::AbstractArray</code>: Array of all possible states in the maze.</li><li><code>goals::AbstractArray</code>: Array of goal states in the maze.</li></ul><p><strong>Example</strong></p><pre><code class="language-julia hljs">using CompressedBeliefMDPs
 
 n_corridors = 8
 corridor_length = 25
-maze = CircularMaze(n_corridors, corridor_length)</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaPOMDP/CompressedBeliefMDPs.jl/blob/c4b5a90c267220a5286710590c99327d0ab3e30c/src/envs/circular.jl#L15-L41">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="CompressedBeliefMDPs.CircularMazeState" href="#CompressedBeliefMDPs.CircularMazeState"><code>CompressedBeliefMDPs.CircularMazeState</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">CircularMazeState(corridor::Integer, x::Integer)</code></pre><p>The <code>CircularMazeState</code> struct represents the state of an agent in a circular maze.</p><p><strong>Fields</strong></p><ul><li><code>corridor::Integer</code>: The corridor number. The value ranges from 1 to <code>n_corridors</code>.</li><li><code>x::Integer</code>: The position of the state within the corridor. The value ranges from 1 to the <code>corridor_length</code>.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaPOMDP/CompressedBeliefMDPs.jl/blob/c4b5a90c267220a5286710590c99327d0ab3e30c/src/envs/circular.jl#L1-L9">source</a></section></article><section class="footnotes is-size-7"><ul><li class="footnote" id="footnote-1"><a class="tag is-link" href="#citeref-1">1</a>Roy doesn&#39;t actually name his toy environment. For the original environment details, see the &quot;PCA Performance&quot; subsection on page 8.</li></ul></section></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../compressors/">« Compressors</a><a class="docs-footer-nextpage" href="../api/">API Documentation »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="auto">Automatic (OS)</option><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.3.0 on <span class="colophon-date" title="Wednesday 24 July 2024 22:30">Wednesday 24 July 2024</span>. Using Julia version 1.10.4.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
+maze = CircularMaze(n_corridors, corridor_length)</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaPOMDP/CompressedBeliefMDPs.jl/blob/9edfeef29b63574c75127f840336cf66659accf2/src/envs/circular.jl#L15-L41">source</a></section></article><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="CompressedBeliefMDPs.CircularMazeState" href="#CompressedBeliefMDPs.CircularMazeState"><code>CompressedBeliefMDPs.CircularMazeState</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">CircularMazeState(corridor::Integer, x::Integer)</code></pre><p>The <code>CircularMazeState</code> struct represents the state of an agent in a circular maze.</p><p><strong>Fields</strong></p><ul><li><code>corridor::Integer</code>: The corridor number. The value ranges from 1 to <code>n_corridors</code>.</li><li><code>x::Integer</code>: The position of the state within the corridor. The value ranges from 1 to the <code>corridor_length</code>.</li></ul></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaPOMDP/CompressedBeliefMDPs.jl/blob/9edfeef29b63574c75127f840336cf66659accf2/src/envs/circular.jl#L1-L9">source</a></section></article><section class="footnotes is-size-7"><ul><li class="footnote" id="footnote-1"><a class="tag is-link" href="#citeref-1">1</a>Roy doesn&#39;t actually name his toy environment. For the original environment details, see the &quot;PCA Performance&quot; subsection on page 8.</li></ul></section></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../compressors/">« Compressors</a><a class="docs-footer-nextpage" href="../api/">API Documentation »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="auto">Automatic (OS)</option><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.3.0 on <span class="colophon-date" title="Thursday 1 August 2024 17:43">Thursday 1 August 2024</span>. Using Julia version 1.10.4.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
diff --git a/dev/compressors/index.html b/dev/compressors/index.html
index d2fa485..a8c5c43 100644
--- a/dev/compressors/index.html
+++ b/dev/compressors/index.html
@@ -12,4 +12,4 @@
 
 function fit!(c::MyCompressor, beliefs)
     # YOUR CODE HERE
-end</code></pre><h4 id="Implementation-Tips"><a class="docs-heading-anchor" href="#Implementation-Tips">Implementation Tips</a><a id="Implementation-Tips-1"></a><a class="docs-heading-anchor-permalink" href="#Implementation-Tips" title="Permalink"></a></h4><ul><li>For robustness, both the functor and <code>fit!</code> should be able to handle <code>AbstractVector</code> and <code>AbstractMatrix</code> inputs. </li><li><code>fit!</code> is called only once after beliefs are sampled from the POMDP.</li><li><code>CompressedBeliefSolver</code> will attempt to convert each belief state (often of type <a href="https://juliapomdp.github.io/POMDPs.jl/latest/POMDPTools/beliefs/#POMDPTools.BeliefUpdaters.DiscreteBelief"><code>DiscreteBelief</code></a>) into an <code>AbstractArray{Float64}</code> using <a href="https://juliapomdp.github.io/POMDPs.jl/latest/api/#POMDPs.convert_s"><code>convert_s</code></a>. As a convenience, CompressedBeliefMDP implements conversions for commonly used belief types; however, if the POMDP has a custom belief state, then it is the users&#39; responsibility to implement the appropriate conversion. See the source code for help. </li></ul><h2 id="Implemented-Compressors"><a class="docs-heading-anchor" href="#Implemented-Compressors">Implemented Compressors</a><a id="Implemented-Compressors-1"></a><a class="docs-heading-anchor-permalink" href="#Implemented-Compressors" title="Permalink"></a></h2><p>CompressedBeliefMDPs currently provides wrappers for the following compression types:</p><ul><li>a principal component analysis (PCA) compressor,</li><li>a kernel PCA compressor,</li><li>a probabilistic PCA compressor,</li><li>a factor analysis compressor,</li><li>an isomap compressor,</li><li>an autoencoder compressor</li><li>a variational auto-encoder (VAE) compressor</li></ul><h3 id="Principal-Component-Analysis-(PCA)"><a class="docs-heading-anchor" href="#Principal-Component-Analysis-(PCA)">Principal Component Analysis (PCA)</a><a id="Principal-Component-Analysis-(PCA)-1"></a><a class="docs-heading-anchor-permalink" href="#Principal-Component-Analysis-(PCA)" title="Permalink"></a></h3><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="CompressedBeliefMDPs.PCACompressor" href="#CompressedBeliefMDPs.PCACompressor"><code>CompressedBeliefMDPs.PCACompressor</code></a> — <span class="docstring-category">Function</span></header><section><div><p>Wrapper for <a href="https://juliastats.org/MultivariateStats.jl/stable/pca/#Linear-Principal-Component-Analysis"><code>MultivariateStats.PCA</code></a>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaPOMDP/CompressedBeliefMDPs.jl/blob/c4b5a90c267220a5286710590c99327d0ab3e30c/src/compressors/mvs_compressors.jl#L36">source</a></section></article><h3 id="Kernel-PCA"><a class="docs-heading-anchor" href="#Kernel-PCA">Kernel PCA</a><a id="Kernel-PCA-1"></a><a class="docs-heading-anchor-permalink" href="#Kernel-PCA" title="Permalink"></a></h3><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="CompressedBeliefMDPs.KernelPCACompressor" href="#CompressedBeliefMDPs.KernelPCACompressor"><code>CompressedBeliefMDPs.KernelPCACompressor</code></a> — <span class="docstring-category">Function</span></header><section><div><p>Wrapper for <a href="https://juliastats.org/MultivariateStats.jl/stable/pca/#Kernel-Principal-Component-Analysis"><code>MultivariateStats.KernelPCA</code></a>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaPOMDP/CompressedBeliefMDPs.jl/blob/c4b5a90c267220a5286710590c99327d0ab3e30c/src/compressors/mvs_compressors.jl#L43">source</a></section></article><h3 id="Probabilistic-PCA"><a class="docs-heading-anchor" href="#Probabilistic-PCA">Probabilistic PCA</a><a id="Probabilistic-PCA-1"></a><a class="docs-heading-anchor-permalink" href="#Probabilistic-PCA" title="Permalink"></a></h3><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="CompressedBeliefMDPs.PPCACompressor" href="#CompressedBeliefMDPs.PPCACompressor"><code>CompressedBeliefMDPs.PPCACompressor</code></a> — <span class="docstring-category">Function</span></header><section><div><p>Wrapper for <a href="https://juliastats.org/MultivariateStats.jl/stable/pca/#Probabilistic-Principal-Component-Analysis"><code>MultivariateStats.PPCA</code></a>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaPOMDP/CompressedBeliefMDPs.jl/blob/c4b5a90c267220a5286710590c99327d0ab3e30c/src/compressors/mvs_compressors.jl#L50">source</a></section></article><h3 id="Factor-Analysis"><a class="docs-heading-anchor" href="#Factor-Analysis">Factor Analysis</a><a id="Factor-Analysis-1"></a><a class="docs-heading-anchor-permalink" href="#Factor-Analysis" title="Permalink"></a></h3><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="CompressedBeliefMDPs.FactorAnalysisCompressor" href="#CompressedBeliefMDPs.FactorAnalysisCompressor"><code>CompressedBeliefMDPs.FactorAnalysisCompressor</code></a> — <span class="docstring-category">Function</span></header><section><div><p>Wrapper for <a href="https://juliastats.org/MultivariateStats.jl/stable/fa/"><code>MultivariateStats.FactorAnalysis</code></a></p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaPOMDP/CompressedBeliefMDPs.jl/blob/c4b5a90c267220a5286710590c99327d0ab3e30c/src/compressors/mvs_compressors.jl#L58">source</a></section></article><h3 id="Isomap"><a class="docs-heading-anchor" href="#Isomap">Isomap</a><a id="Isomap-1"></a><a class="docs-heading-anchor-permalink" href="#Isomap" title="Permalink"></a></h3><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="CompressedBeliefMDPs.IsomapCompressor" href="#CompressedBeliefMDPs.IsomapCompressor"><code>CompressedBeliefMDPs.IsomapCompressor</code></a> — <span class="docstring-category">Function</span></header><section><div><p>Wrapper for <a href="https://wildart.github.io/ManifoldLearning.jl/stable/isomap/#Isomap"><code>ManifoldLearning.Isomap</code></a>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaPOMDP/CompressedBeliefMDPs.jl/blob/c4b5a90c267220a5286710590c99327d0ab3e30c/src/compressors/manifold_compressors.jl#L34">source</a></section></article><h3 id="Autoencoder"><a class="docs-heading-anchor" href="#Autoencoder">Autoencoder</a><a id="Autoencoder-1"></a><a class="docs-heading-anchor-permalink" href="#Autoencoder" title="Permalink"></a></h3><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="CompressedBeliefMDPs.AutoencoderCompressor" href="#CompressedBeliefMDPs.AutoencoderCompressor"><code>CompressedBeliefMDPs.AutoencoderCompressor</code></a> — <span class="docstring-category">Type</span></header><section><div><p>Implements an autoencoder in Flux.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaPOMDP/CompressedBeliefMDPs.jl/blob/c4b5a90c267220a5286710590c99327d0ab3e30c/src/compressors/autoencoders.jl#L11-L13">source</a></section></article><h3 id="Variational-Auto-Encoder-(VAE)"><a class="docs-heading-anchor" href="#Variational-Auto-Encoder-(VAE)">Variational Auto-Encoder (VAE)</a><a id="Variational-Auto-Encoder-(VAE)-1"></a><a class="docs-heading-anchor-permalink" href="#Variational-Auto-Encoder-(VAE)" title="Permalink"></a></h3><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="CompressedBeliefMDPs.VAECompressor" href="#CompressedBeliefMDPs.VAECompressor"><code>CompressedBeliefMDPs.VAECompressor</code></a> — <span class="docstring-category">Type</span></header><section><div><p>Implements a <a href="https://arxiv.org/abs/1312.6114">VAE</a> in Flux.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaPOMDP/CompressedBeliefMDPs.jl/blob/c4b5a90c267220a5286710590c99327d0ab3e30c/src/compressors/vae.jl#L57-L59">source</a></section></article><div class="admonition is-warning"><header class="admonition-header">Warning</header><div class="admonition-body"><p>Some compression algorithms aren&#39;t optimized for large belief spaces. While they pass our unit tests, they may fail on large POMDPs or without seeding. For large POMDPs, users may want a custom <code>Compressor</code>.</p></div></div></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../samplers/">« Samplers</a><a class="docs-footer-nextpage" href="../circular/">Environments »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="auto">Automatic (OS)</option><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.3.0 on <span class="colophon-date" title="Wednesday 24 July 2024 22:30">Wednesday 24 July 2024</span>. Using Julia version 1.10.4.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
+end</code></pre><h4 id="Implementation-Tips"><a class="docs-heading-anchor" href="#Implementation-Tips">Implementation Tips</a><a id="Implementation-Tips-1"></a><a class="docs-heading-anchor-permalink" href="#Implementation-Tips" title="Permalink"></a></h4><ul><li>For robustness, both the functor and <code>fit!</code> should be able to handle <code>AbstractVector</code> and <code>AbstractMatrix</code> inputs. </li><li><code>fit!</code> is called only once after beliefs are sampled from the POMDP.</li><li><code>CompressedBeliefSolver</code> will attempt to convert each belief state (often of type <a href="https://juliapomdp.github.io/POMDPs.jl/latest/POMDPTools/beliefs/#POMDPTools.BeliefUpdaters.DiscreteBelief"><code>DiscreteBelief</code></a>) into an <code>AbstractArray{Float64}</code> using <a href="https://juliapomdp.github.io/POMDPs.jl/latest/api/#POMDPs.convert_s"><code>convert_s</code></a>. As a convenience, CompressedBeliefMDP implements conversions for commonly used belief types; however, if the POMDP has a custom belief state, then it is the users&#39; responsibility to implement the appropriate conversion. See the source code for help. </li></ul><h2 id="Implemented-Compressors"><a class="docs-heading-anchor" href="#Implemented-Compressors">Implemented Compressors</a><a id="Implemented-Compressors-1"></a><a class="docs-heading-anchor-permalink" href="#Implemented-Compressors" title="Permalink"></a></h2><p>CompressedBeliefMDPs currently provides wrappers for the following compression types:</p><ul><li>a principal component analysis (PCA) compressor,</li><li>a kernel PCA compressor,</li><li>a probabilistic PCA compressor,</li><li>a factor analysis compressor,</li><li>an isomap compressor,</li><li>an autoencoder compressor</li><li>a variational auto-encoder (VAE) compressor</li></ul><h3 id="Principal-Component-Analysis-(PCA)"><a class="docs-heading-anchor" href="#Principal-Component-Analysis-(PCA)">Principal Component Analysis (PCA)</a><a id="Principal-Component-Analysis-(PCA)-1"></a><a class="docs-heading-anchor-permalink" href="#Principal-Component-Analysis-(PCA)" title="Permalink"></a></h3><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="CompressedBeliefMDPs.PCACompressor" href="#CompressedBeliefMDPs.PCACompressor"><code>CompressedBeliefMDPs.PCACompressor</code></a> — <span class="docstring-category">Function</span></header><section><div><p>Wrapper for <a href="https://juliastats.org/MultivariateStats.jl/stable/pca/#Linear-Principal-Component-Analysis"><code>MultivariateStats.PCA</code></a>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaPOMDP/CompressedBeliefMDPs.jl/blob/9edfeef29b63574c75127f840336cf66659accf2/src/compressors/mvs_compressors.jl#L36">source</a></section></article><h3 id="Kernel-PCA"><a class="docs-heading-anchor" href="#Kernel-PCA">Kernel PCA</a><a id="Kernel-PCA-1"></a><a class="docs-heading-anchor-permalink" href="#Kernel-PCA" title="Permalink"></a></h3><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="CompressedBeliefMDPs.KernelPCACompressor" href="#CompressedBeliefMDPs.KernelPCACompressor"><code>CompressedBeliefMDPs.KernelPCACompressor</code></a> — <span class="docstring-category">Function</span></header><section><div><p>Wrapper for <a href="https://juliastats.org/MultivariateStats.jl/stable/pca/#Kernel-Principal-Component-Analysis"><code>MultivariateStats.KernelPCA</code></a>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaPOMDP/CompressedBeliefMDPs.jl/blob/9edfeef29b63574c75127f840336cf66659accf2/src/compressors/mvs_compressors.jl#L43">source</a></section></article><h3 id="Probabilistic-PCA"><a class="docs-heading-anchor" href="#Probabilistic-PCA">Probabilistic PCA</a><a id="Probabilistic-PCA-1"></a><a class="docs-heading-anchor-permalink" href="#Probabilistic-PCA" title="Permalink"></a></h3><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="CompressedBeliefMDPs.PPCACompressor" href="#CompressedBeliefMDPs.PPCACompressor"><code>CompressedBeliefMDPs.PPCACompressor</code></a> — <span class="docstring-category">Function</span></header><section><div><p>Wrapper for <a href="https://juliastats.org/MultivariateStats.jl/stable/pca/#Probabilistic-Principal-Component-Analysis"><code>MultivariateStats.PPCA</code></a>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaPOMDP/CompressedBeliefMDPs.jl/blob/9edfeef29b63574c75127f840336cf66659accf2/src/compressors/mvs_compressors.jl#L50">source</a></section></article><h3 id="Factor-Analysis"><a class="docs-heading-anchor" href="#Factor-Analysis">Factor Analysis</a><a id="Factor-Analysis-1"></a><a class="docs-heading-anchor-permalink" href="#Factor-Analysis" title="Permalink"></a></h3><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="CompressedBeliefMDPs.FactorAnalysisCompressor" href="#CompressedBeliefMDPs.FactorAnalysisCompressor"><code>CompressedBeliefMDPs.FactorAnalysisCompressor</code></a> — <span class="docstring-category">Function</span></header><section><div><p>Wrapper for <a href="https://juliastats.org/MultivariateStats.jl/stable/fa/"><code>MultivariateStats.FactorAnalysis</code></a></p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaPOMDP/CompressedBeliefMDPs.jl/blob/9edfeef29b63574c75127f840336cf66659accf2/src/compressors/mvs_compressors.jl#L58">source</a></section></article><h3 id="Isomap"><a class="docs-heading-anchor" href="#Isomap">Isomap</a><a id="Isomap-1"></a><a class="docs-heading-anchor-permalink" href="#Isomap" title="Permalink"></a></h3><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="CompressedBeliefMDPs.IsomapCompressor" href="#CompressedBeliefMDPs.IsomapCompressor"><code>CompressedBeliefMDPs.IsomapCompressor</code></a> — <span class="docstring-category">Function</span></header><section><div><p>Wrapper for <a href="https://wildart.github.io/ManifoldLearning.jl/stable/isomap/#Isomap"><code>ManifoldLearning.Isomap</code></a>.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaPOMDP/CompressedBeliefMDPs.jl/blob/9edfeef29b63574c75127f840336cf66659accf2/src/compressors/manifold_compressors.jl#L34">source</a></section></article><h3 id="Autoencoder"><a class="docs-heading-anchor" href="#Autoencoder">Autoencoder</a><a id="Autoencoder-1"></a><a class="docs-heading-anchor-permalink" href="#Autoencoder" title="Permalink"></a></h3><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="CompressedBeliefMDPs.AutoencoderCompressor" href="#CompressedBeliefMDPs.AutoencoderCompressor"><code>CompressedBeliefMDPs.AutoencoderCompressor</code></a> — <span class="docstring-category">Type</span></header><section><div><p>Implements an autoencoder in Flux.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaPOMDP/CompressedBeliefMDPs.jl/blob/9edfeef29b63574c75127f840336cf66659accf2/src/compressors/autoencoders.jl#L11-L13">source</a></section></article><h3 id="Variational-Auto-Encoder-(VAE)"><a class="docs-heading-anchor" href="#Variational-Auto-Encoder-(VAE)">Variational Auto-Encoder (VAE)</a><a id="Variational-Auto-Encoder-(VAE)-1"></a><a class="docs-heading-anchor-permalink" href="#Variational-Auto-Encoder-(VAE)" title="Permalink"></a></h3><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="CompressedBeliefMDPs.VAECompressor" href="#CompressedBeliefMDPs.VAECompressor"><code>CompressedBeliefMDPs.VAECompressor</code></a> — <span class="docstring-category">Type</span></header><section><div><p>Implements a <a href="https://arxiv.org/abs/1312.6114">VAE</a> in Flux.</p></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaPOMDP/CompressedBeliefMDPs.jl/blob/9edfeef29b63574c75127f840336cf66659accf2/src/compressors/vae.jl#L57-L59">source</a></section></article><div class="admonition is-warning"><header class="admonition-header">Warning</header><div class="admonition-body"><p>Some compression algorithms aren&#39;t optimized for large belief spaces. While they pass our unit tests, they may fail on large POMDPs or without seeding. For large POMDPs, users may want a custom <code>Compressor</code>.</p></div></div></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../samplers/">« Samplers</a><a class="docs-footer-nextpage" href="../circular/">Environments »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="auto">Automatic (OS)</option><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.3.0 on <span class="colophon-date" title="Thursday 1 August 2024 17:43">Thursday 1 August 2024</span>. Using Julia version 1.10.4.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
diff --git a/dev/index.html b/dev/index.html
index 421f36d..5fa669b 100644
--- a/dev/index.html
+++ b/dev/index.html
@@ -65,4 +65,4 @@
 )
 policy = solve(solver, pomdp)
 rs = RolloutSimulator(max_steps=50)
-r = simulate(rs, pomdp, policy)</code></pre><h2 id="Concepts-and-Architecture"><a class="docs-heading-anchor" href="#Concepts-and-Architecture">Concepts and Architecture</a><a id="Concepts-and-Architecture-1"></a><a class="docs-heading-anchor-permalink" href="#Concepts-and-Architecture" title="Permalink"></a></h2><p>CompressedBeliefMDPs.jl aims to implement a generalization of the <a href="https://papers.nips.cc/paper_files/paper/2002/hash/a11f9e533f28593768ebf87075ab34f2-Abstract.html">belief compression algorithm</a> for solving large POMDPs. The algorithm has four steps:</p><ol><li>collect belief samples,</li><li>compress the samples,</li><li>create the compressed belief-state MDP,</li><li>solve the MDP.</li></ol><p>Each step is handled by <code>Sampler</code>, <code>Compressor</code>, <code>CompressedBeliefMDP</code>, and <code>CompressedBeliefSolver</code> respectively.</p><p>For more details, please see the rest of the documentation or the associated paper.</p></article><nav class="docs-footer"><a class="docs-footer-nextpage" href="samplers/">Samplers »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="auto">Automatic (OS)</option><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.3.0 on <span class="colophon-date" title="Wednesday 24 July 2024 22:30">Wednesday 24 July 2024</span>. Using Julia version 1.10.4.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
+r = simulate(rs, pomdp, policy)</code></pre><h2 id="Concepts-and-Architecture"><a class="docs-heading-anchor" href="#Concepts-and-Architecture">Concepts and Architecture</a><a id="Concepts-and-Architecture-1"></a><a class="docs-heading-anchor-permalink" href="#Concepts-and-Architecture" title="Permalink"></a></h2><p>CompressedBeliefMDPs.jl aims to implement a generalization of the <a href="https://papers.nips.cc/paper_files/paper/2002/hash/a11f9e533f28593768ebf87075ab34f2-Abstract.html">belief compression algorithm</a> for solving large POMDPs. The algorithm has four steps:</p><ol><li>collect belief samples,</li><li>compress the samples,</li><li>create the compressed belief-state MDP,</li><li>solve the MDP.</li></ol><p>Each step is handled by <code>Sampler</code>, <code>Compressor</code>, <code>CompressedBeliefMDP</code>, and <code>CompressedBeliefSolver</code> respectively.</p><p>For more details, please see the rest of the documentation or the associated paper.</p></article><nav class="docs-footer"><a class="docs-footer-nextpage" href="samplers/">Samplers »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="auto">Automatic (OS)</option><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.3.0 on <span class="colophon-date" title="Thursday 1 August 2024 17:43">Thursday 1 August 2024</span>. Using Julia version 1.10.4.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
diff --git a/dev/samplers/index.html b/dev/samplers/index.html
index 14601d6..f247da8 100644
--- a/dev/samplers/index.html
+++ b/dev/samplers/index.html
@@ -16,13 +16,13 @@
   DiscreteBelief{TigerPOMDP, Bool}(TigerPOMDP(-1.0, -100.0, 10.0, 0.85, 0.95), Bool[0, 1], [0.15000000000000002, 0.85])
   DiscreteBelief{TigerPOMDP, Bool}(TigerPOMDP(-1.0, -100.0, 10.0, 0.85, 0.95), Bool[0, 1], [0.5, 0.5])
   DiscreteBelief{TigerPOMDP, Bool}(TigerPOMDP(-1.0, -100.0, 10.0, 0.85, 0.95), Bool[0, 1], [0.85, 0.15000000000000002])
-  DiscreteBelief{TigerPOMDP, Bool}(TigerPOMDP(-1.0, -100.0, 10.0, 0.85, 0.95), Bool[0, 1], [0.9697986577181208, 0.030201342281879207])</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaPOMDP/CompressedBeliefMDPs.jl/blob/c4b5a90c267220a5286710590c99327d0ab3e30c/src/samplers/expansion.jl#L4-L38">source</a></section></article><h3 id="Policy-Sampler"><a class="docs-heading-anchor" href="#Policy-Sampler">Policy Sampler</a><a id="Policy-Sampler-1"></a><a class="docs-heading-anchor-permalink" href="#Policy-Sampler" title="Permalink"></a></h3><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="CompressedBeliefMDPs.PolicySampler" href="#CompressedBeliefMDPs.PolicySampler"><code>CompressedBeliefMDPs.PolicySampler</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">PolicySampler</code></pre><p>Samples belief states by rolling out a <code>Policy</code>.</p><p><strong>Fields</strong></p><ul><li><code>policy::Policy</code>: The policy used for decision making.</li><li><code>updater::Updater</code>: The updater used for updating beliefs.</li><li><code>n::Integer</code>: The maximum number of simulated steps.</li><li><code>rng::AbstractRNG</code>: The random number generator used for sampling.</li><li><code>verbose::Bool</code>: Whether to use a progress bar while sampling.</li></ul><p><strong>Constructors</strong></p><pre><code class="nohighlight hljs">PolicySampler(pomdp::POMDP; policy::Policy=RandomPolicy(pomdp), 
+  DiscreteBelief{TigerPOMDP, Bool}(TigerPOMDP(-1.0, -100.0, 10.0, 0.85, 0.95), Bool[0, 1], [0.9697986577181208, 0.030201342281879207])</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaPOMDP/CompressedBeliefMDPs.jl/blob/9edfeef29b63574c75127f840336cf66659accf2/src/samplers/expansion.jl#L4-L38">source</a></section></article><h3 id="Policy-Sampler"><a class="docs-heading-anchor" href="#Policy-Sampler">Policy Sampler</a><a id="Policy-Sampler-1"></a><a class="docs-heading-anchor-permalink" href="#Policy-Sampler" title="Permalink"></a></h3><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="CompressedBeliefMDPs.PolicySampler" href="#CompressedBeliefMDPs.PolicySampler"><code>CompressedBeliefMDPs.PolicySampler</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">PolicySampler</code></pre><p>Samples belief states by rolling out a <code>Policy</code>.</p><p><strong>Fields</strong></p><ul><li><code>policy::Policy</code>: The policy used for decision making.</li><li><code>updater::Updater</code>: The updater used for updating beliefs.</li><li><code>n::Integer</code>: The maximum number of simulated steps.</li><li><code>rng::AbstractRNG</code>: The random number generator used for sampling.</li><li><code>verbose::Bool</code>: Whether to use a progress bar while sampling.</li></ul><p><strong>Constructors</strong></p><pre><code class="nohighlight hljs">PolicySampler(pomdp::POMDP; policy::Policy=RandomPolicy(pomdp), 
 updater::Updater=DiscreteUpdater(pomdp), n::Integer=10, 
 rng::AbstractRNG=Random.GLOBAL_RNG)</code></pre><p><strong>Methods</strong></p><pre><code class="nohighlight hljs">(s::PolicySampler)(pomdp::POMDP)</code></pre><p>Returns a vector of <em>unique</em> belief states.</p><p><strong>Example</strong></p><pre><code class="language-julia-repl hljs">julia&gt; pomdp = TigerPOMDP();
 julia&gt; sampler = PolicySampler(pomdp; n=3); 
 julia&gt; 2-element Vector{Any}:
 DiscreteBelief{TigerPOMDP, Bool}(TigerPOMDP(-1.0, -100.0, 10.0, 0.85, 0.95), Bool[0, 1], [0.5, 0.5])
-DiscreteBelief{TigerPOMDP, Bool}(TigerPOMDP(-1.0, -100.0, 10.0, 0.85, 0.95), Bool[0, 1], [0.15000000000000002, 0.85])</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaPOMDP/CompressedBeliefMDPs.jl/blob/c4b5a90c267220a5286710590c99327d0ab3e30c/src/samplers/rollout.jl#L1-L32">source</a></section></article><h3 id="ExplorationPolicy-Sampler"><a class="docs-heading-anchor" href="#ExplorationPolicy-Sampler">ExplorationPolicy Sampler</a><a id="ExplorationPolicy-Sampler-1"></a><a class="docs-heading-anchor-permalink" href="#ExplorationPolicy-Sampler" title="Permalink"></a></h3><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="CompressedBeliefMDPs.ExplorationPolicySampler" href="#CompressedBeliefMDPs.ExplorationPolicySampler"><code>CompressedBeliefMDPs.ExplorationPolicySampler</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">ExplorationPolicySampler</code></pre><p>Samples belief states by rolling out an <code>ExplorationPolicy</code>. Essentially identical to <code>PolicySampler</code>.</p><p><strong>Fields</strong></p><ul><li><code>explorer::ExplorationPolicy</code>: The <code>ExplorationPolicy</code> used for decision making.</li><li><code>on_policy::Policy</code>: The fallback <code>Policy</code> used for decision making when not exploring.</li><li><code>updater::Updater</code>: The updater used for updating beliefs.</li><li><code>n::Integer</code>: The maximum number of simulated steps.</li><li><code>rng::AbstractRNG</code>: The random number generator used for sampling.</li><li><code>verbose::Bool</code>: Whether to use a progress bar while sampling.</li></ul><p><strong>Constructors</strong></p><pre><code class="nohighlight hljs">ExplorationPolicySampler(pomdp::POMDP; rng::AbstractRNG=Random.GLOBAL_RNG,
+DiscreteBelief{TigerPOMDP, Bool}(TigerPOMDP(-1.0, -100.0, 10.0, 0.85, 0.95), Bool[0, 1], [0.15000000000000002, 0.85])</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaPOMDP/CompressedBeliefMDPs.jl/blob/9edfeef29b63574c75127f840336cf66659accf2/src/samplers/rollout.jl#L1-L32">source</a></section></article><h3 id="ExplorationPolicy-Sampler"><a class="docs-heading-anchor" href="#ExplorationPolicy-Sampler">ExplorationPolicy Sampler</a><a id="ExplorationPolicy-Sampler-1"></a><a class="docs-heading-anchor-permalink" href="#ExplorationPolicy-Sampler" title="Permalink"></a></h3><article class="docstring"><header><a class="docstring-article-toggle-button fa-solid fa-chevron-down" href="javascript:;" title="Collapse docstring"></a><a class="docstring-binding" id="CompressedBeliefMDPs.ExplorationPolicySampler" href="#CompressedBeliefMDPs.ExplorationPolicySampler"><code>CompressedBeliefMDPs.ExplorationPolicySampler</code></a> — <span class="docstring-category">Type</span></header><section><div><pre><code class="language-julia hljs">ExplorationPolicySampler</code></pre><p>Samples belief states by rolling out an <code>ExplorationPolicy</code>. Essentially identical to <code>PolicySampler</code>.</p><p><strong>Fields</strong></p><ul><li><code>explorer::ExplorationPolicy</code>: The <code>ExplorationPolicy</code> used for decision making.</li><li><code>on_policy::Policy</code>: The fallback <code>Policy</code> used for decision making when not exploring.</li><li><code>updater::Updater</code>: The updater used for updating beliefs.</li><li><code>n::Integer</code>: The maximum number of simulated steps.</li><li><code>rng::AbstractRNG</code>: The random number generator used for sampling.</li><li><code>verbose::Bool</code>: Whether to use a progress bar while sampling.</li></ul><p><strong>Constructors</strong></p><pre><code class="nohighlight hljs">ExplorationPolicySampler(pomdp::POMDP; rng::AbstractRNG=Random.GLOBAL_RNG,
 explorer::ExplorationPolicy=EpsGreedyPolicy(pomdp, 0.1; rng=rng), on_policy=RandomPolicy(pomdp),
 updater::Updater=DiscreteUpdater(pomdp), n::Integer=10)</code></pre><p><strong>Methods</strong></p><pre><code class="nohighlight hljs">(s::ExplorationPolicySampler)(pomdp::POMDP)</code></pre><p>Returns a vector of <em>unique</em> belief states.</p><p><strong>Example Usage</strong></p><pre><code class="language-julia-repl hljs">julia&gt; pomdp = TigerPOMDP()
 julia&gt; sampler = ExplorationPolicySampler(pomdp; n=30)
@@ -30,4 +30,4 @@
 3-element Vector{Any}:
  DiscreteBelief{TigerPOMDP, Bool}(TigerPOMDP(-1.0, -100.0, 10.0, 0.85, 0.95), Bool[0, 1], [0.5, 0.5])
  DiscreteBelief{TigerPOMDP, Bool}(TigerPOMDP(-1.0, -100.0, 10.0, 0.85, 0.95), Bool[0, 1], [0.85, 0.15000000000000002])
- DiscreteBelief{TigerPOMDP, Bool}(TigerPOMDP(-1.0, -100.0, 10.0, 0.85, 0.95), Bool[0, 1], [0.9697986577181208, 0.030201342281879207])</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaPOMDP/CompressedBeliefMDPs.jl/blob/c4b5a90c267220a5286710590c99327d0ab3e30c/src/samplers/rollout.jl#L117-L151">source</a></section></article></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../">« CompressedBeliefMDPs.jl</a><a class="docs-footer-nextpage" href="../compressors/">Compressors »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="auto">Automatic (OS)</option><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.3.0 on <span class="colophon-date" title="Wednesday 24 July 2024 22:30">Wednesday 24 July 2024</span>. Using Julia version 1.10.4.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
+ DiscreteBelief{TigerPOMDP, Bool}(TigerPOMDP(-1.0, -100.0, 10.0, 0.85, 0.95), Bool[0, 1], [0.9697986577181208, 0.030201342281879207])</code></pre></div><a class="docs-sourcelink" target="_blank" href="https://github.com/JuliaPOMDP/CompressedBeliefMDPs.jl/blob/9edfeef29b63574c75127f840336cf66659accf2/src/samplers/rollout.jl#L117-L151">source</a></section></article></article><nav class="docs-footer"><a class="docs-footer-prevpage" href="../">« CompressedBeliefMDPs.jl</a><a class="docs-footer-nextpage" href="../compressors/">Compressors »</a><div class="flexbox-break"></div><p class="footer-message">Powered by <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <a href="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><div class="modal" id="documenter-settings"><div class="modal-background"></div><div class="modal-card"><header class="modal-card-head"><p class="modal-card-title">Settings</p><button class="delete"></button></header><section class="modal-card-body"><p><label class="label">Theme</label><div class="select"><select id="documenter-themepicker"><option value="auto">Automatic (OS)</option><option value="documenter-light">documenter-light</option><option value="documenter-dark">documenter-dark</option></select></div></p><hr/><p>This document was generated with <a href="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.3.0 on <span class="colophon-date" title="Thursday 1 August 2024 17:43">Thursday 1 August 2024</span>. Using Julia version 1.10.4.</p></section><footer class="modal-card-foot"></footer></div></div></div></body></html>
diff --git a/dev/search_index.js b/dev/search_index.js
index f5aa49c..a70aaf8 100644
--- a/dev/search_index.js
+++ b/dev/search_index.js
@@ -1,3 +1,3 @@
 var documenterSearchIndex = {"docs":
-[{"location":"api/#API-Documentation","page":"API Documentation","title":"API Documentation","text":"","category":"section"},{"location":"api/","page":"API Documentation","title":"API Documentation","text":"CurrentModule = CompressedBeliefMDPs","category":"page"},{"location":"api/#Contents","page":"API Documentation","title":"Contents","text":"","category":"section"},{"location":"api/","page":"API Documentation","title":"API Documentation","text":"Pages = [\"api.md\"]","category":"page"},{"location":"api/#Index","page":"API Documentation","title":"Index","text":"","category":"section"},{"location":"api/","page":"API Documentation","title":"API Documentation","text":"Pages = [\"api.md\"]","category":"page"},{"location":"api/#Types/Functors","page":"API Documentation","title":"Types/Functors","text":"","category":"section"},{"location":"api/","page":"API Documentation","title":"API Documentation","text":"Sampler\nCompressor\nCompressedBeliefMDP\nCompressedBeliefPolicy\nCompressedBeliefSolver","category":"page"},{"location":"api/#CompressedBeliefMDPs.Sampler","page":"API Documentation","title":"CompressedBeliefMDPs.Sampler","text":"Abstract type for an object that defines how the belief should be sampled.\n\n\n\n\n\n","category":"type"},{"location":"api/#CompressedBeliefMDPs.Compressor","page":"API Documentation","title":"CompressedBeliefMDPs.Compressor","text":"Abstract type for an object that defines how the belief should be compressed.\n\n\n\n\n\n","category":"type"},{"location":"api/#CompressedBeliefMDPs.CompressedBeliefMDP","page":"API Documentation","title":"CompressedBeliefMDPs.CompressedBeliefMDP","text":"CompressedBeliefMDP{B, A}\n\nThe CompressedBeliefMDP struct is a generalization of the compressed belief-state MDP presented in  Exponential Family PCA for Belief Compression in POMDPs.\n\nType Parameters\n\nB: The type of compressed belief states.\nA: The type of actions.\n\nFields\n\nbmdp::GenerativeBeliefMDP: The generative belief-state MDP.\ncompressor::Compressor: The compressor used to compress belief states.\nϕ::Bijection: A bijection representing the mapping from uncompressed belief states to compressed belief states. See notes. \n\nConstructors\n\nCompressedBeliefMDP(pomdp::POMDP, updater::Updater, compressor::Compressor)\nCompressedBeliefMDP(pomdp::POMDP, sampler::Sampler, updater::Updater, compressor::Compressor)\n\nConstructs a CompressedBeliefMDP using the specified POMDP, updater, and compressor.\n\nwarning: Warning\nThe 4-argument constructor is a quality-of-life constructor that calls fit! on the given compressor. \n\nExample Usage\n\npomdp = TigerPOMDP()\nupdater = DiscreteUpdater(pomdp)\ncompressor = PCACompressor(1)\nmdp = CompressedBeliefMDP(pomdp, updater, compressor)\n\nFor continuous POMDPs, see ParticleFilters.jl.\n\nNotes\n\nWhile compressions aren't usually injective, we cache beliefs and their compressions on a first-come, first-served basis, so we can effectively use a bijection without loss of generality.\n\n\n\n\n\n","category":"type"},{"location":"api/#CompressedBeliefMDPs.CompressedBeliefPolicy","page":"API Documentation","title":"CompressedBeliefMDPs.CompressedBeliefPolicy","text":"CompressedBeliefPolicy\n\nMaps a base policy for the compressed belief-state MDP to a policy for the true POMDP.\n\nFields\n\nm::CompressedBeliefMDP: The compressed belief-state MDP.\nbase_policy::Policy: The base policy used for decision-making in the compressed belief-state MDP.\n\nConstructors\n\nCompressedBeliefPolicy(m::CompressedBeliefMDP, base_policy::Policy)\n\nConstructs a CompressedBeliefPolicy using the specified compressed belief-state MDP and base policy.\n\nExample Usage\n\npolicy = solve(solver, pomdp)\ns = initialstate(pomdp)\na = action(policy, s) # returns the approximately optimal action for state s\nv = value(policy, s)  # returns the approximately optimal value for state s\n\n\n\n\n\n","category":"type"},{"location":"api/#CompressedBeliefMDPs.CompressedBeliefSolver","page":"API Documentation","title":"CompressedBeliefMDPs.CompressedBeliefSolver","text":"CompressedBeliefSolver\n\nThe CompressedBeliefSolver struct represents a solver for compressed belief-state MDPs. It combines a compressed belief-state MDP with a base solver to approximate the value function.\n\nFields\n\nm::CompressedBeliefMDP: The compressed belief-state MDP.\nbase_solver::Solver: The base solver used to solve the compressed belief-state MDP.\n\nConstructors\n\nCompressedBeliefSolver(pomdp::POMDP, base_solver::Solver; updater::Updater=DiscreteUpdater(pomdp), sampler::Sampler=BeliefExpansionSampler(pomdp), compressor::Compressor=PCACompressor(1))\nCompressedBeliefSolver(pomdp::POMDP; updater::Updater=DiscreteUpdater(pomdp), sampler::Sampler=BeliefExpansionSampler(pomdp), compressor::Compressor=PCACompressor(1), interp::Union{Nothing, LocalFunctionApproximator}=nothing, k::Int=1, verbose::Bool=false, max_iterations::Int=1000, n_generative_samples::Int=10, belres::Float64=1e-3)\n\nConstructs a CompressedBeliefSolver using the specified POMDP, base solver, updater, sampler, and compressor. Alternatively, you can omit the base solver in which case a LocalApproximationValueIterationSolver(https://github.com/JuliaPOMDP/LocalApproximationValueIteration.jl) will be created instead. For example, different base solvers are needed if the POMDP state and action space are continuous.\n\nExample Usage\n\njulia> pomdp = TigerPOMDP();\njulia> solver = CompressedBeliefSolver(pomdp; verbose=true, max_iterations=10);\njulia> solve(solver, pomdp);\n[Iteration 1   ] residual:       8.51 | iteration runtime:    635.870 ms, (     0.636 s total)\n[Iteration 2   ] residual:       3.63 | iteration runtime:      0.504 ms, (     0.636 s total)\n[Iteration 3   ] residual:       10.1 | iteration runtime:      0.445 ms, (     0.637 s total)\n[Iteration 4   ] residual:       15.2 | iteration runtime:      0.494 ms, (     0.637 s total)\n[Iteration 5   ] residual:       6.72 | iteration runtime:      0.432 ms, (     0.638 s total)\n[Iteration 6   ] residual:       7.38 | iteration runtime:      0.508 ms, (     0.638 s total)\n[Iteration 7   ] residual:       6.03 | iteration runtime:      0.495 ms, (     0.639 s total)\n[Iteration 8   ] residual:       5.73 | iteration runtime:      0.585 ms, (     0.639 s total)\n[Iteration 9   ] residual:       4.02 | iteration runtime:      0.463 ms, (      0.64 s total)\n[Iteration 10  ] residual:       7.28 | iteration runtime:      0.576 ms, (      0.64 s total)\n\n\n\n\n\n","category":"type"},{"location":"api/#Functions","page":"API Documentation","title":"Functions","text":"","category":"section"},{"location":"api/","page":"API Documentation","title":"API Documentation","text":"fit!\nmake_cache\nmake_numerical\ncompress_POMDP","category":"page"},{"location":"api/#CompressedBeliefMDPs.fit!","page":"API Documentation","title":"CompressedBeliefMDPs.fit!","text":"fit!(compressor::Compressor, beliefs)\n\nFit the compressor to beliefs.\n\n\n\n\n\n","category":"function"},{"location":"api/#CompressedBeliefMDPs.make_cache","page":"API Documentation","title":"CompressedBeliefMDPs.make_cache","text":"make_cache(B, B̃)\n\nHelper function that creates a cache that maps each unique belief from the set B to its corresponding compressed representation in B̃.\n\nArguments\n\nB::Vector{<:Any}: A vector of beliefs.\nB̃::Matrix{Float64}: A matrix where each row corresponds to the compressed representation of the beliefs in B.\n\nReturns\n\nDict{<:Any, Vector{Float64}}: A dictionary mapping each unique belief in B to its corresponding compressed representation in B̃.\n\nExample Usage\n\nB = [belief1, belief2, belief3]\nB̃ = [compressed_belief1; compressed_belief2; compressed_belief3]\nϕ = make_cache(B, B̃)\n\n\n\n\n\n","category":"function"},{"location":"api/#CompressedBeliefMDPs.make_numerical","page":"API Documentation","title":"CompressedBeliefMDPs.make_numerical","text":"make_numerical(B, pomdp)\n\nHelper function that converts a set of beliefs B into a numerical matrix representation suitable for processing by numerical algorithms/compressors.\n\nArguments\n\nB::Vector{<:Any}: A vector of beliefs.\npomdp::POMDP: The POMDP model associated with the beliefs.\n\nReturns\n\nMatrix{Float64}: A matrix where each row corresponds to a numerical representation of a belief in B.\n\nExample Usage\n\nB = [belief1, belief2, belief3]\nB_numerical = make_numerical(B, pomdp)\n\n\n\n\n\n","category":"function"},{"location":"api/#CompressedBeliefMDPs.compress_POMDP","page":"API Documentation","title":"CompressedBeliefMDPs.compress_POMDP","text":"compress_POMDP(pomdp, sampler, updater, compressor)\n\nCreates a compressed belief-state MDP by sampling, compressing, and caching beliefs from the given POMDP.\n\nArguments\n\npomdp::POMDP: The POMDP model to be compressed.\nsampler::Sampler: A sampler to generate a set of beliefs from the POMDP.\nupdater::Updater: An updater to initialize beliefs from states.\ncompressor::Compressor: A compressor to reduce the dimensionality of the beliefs.\n\nReturns\n\nCompressedBeliefMDP: The constructed compressed belief-state MDP.\nMatrix{Float64}: A matrix where each row corresponds to the compressed representation of the sampled beliefs.\n\nExample Usage\n\n```julia pomdp = TigerPOMDP() sampler = BeliefExpansionSampler(pomdp) updater = DiscreteUpdater(pomdp) compressor = PCACompressor(2) m, B̃ = compress_POMDP(pomdp, sampler, updater, compressor)\n\n\n\n\n\n","category":"function"},{"location":"compressors/#Compressors","page":"Compressors","title":"Compressors","text":"","category":"section"},{"location":"compressors/#Defining-a-Belief-Compressor","page":"Compressors","title":"Defining a Belief Compressor","text":"","category":"section"},{"location":"compressors/","page":"Compressors","title":"Compressors","text":"In this section, we outline the requirements and guidelines for defining a belief Compressor.","category":"page"},{"location":"compressors/#Interface","page":"Compressors","title":"Interface","text":"","category":"section"},{"location":"compressors/","page":"Compressors","title":"Compressors","text":"The Compressor interface is extremely minimal. It only supports two methods: fit! and the associated functor. For example, if you wanted to implement your own Compressor, you could write something like this","category":"page"},{"location":"compressors/","page":"Compressors","title":"Compressors","text":"struct MyCompressor <: Compressor\n    foo\n    bar\nend\n\n# functor definition\nfunction (c::MyCompressor)(beliefs)\n    # YOUR CODE HERE\n    return compressed_beliefs\nend\n\nfunction fit!(c::MyCompressor, beliefs)\n    # YOUR CODE HERE\nend","category":"page"},{"location":"compressors/#Implementation-Tips","page":"Compressors","title":"Implementation Tips","text":"","category":"section"},{"location":"compressors/","page":"Compressors","title":"Compressors","text":"For robustness, both the functor and fit! should be able to handle AbstractVector and AbstractMatrix inputs. \nfit! is called only once after beliefs are sampled from the POMDP.\nCompressedBeliefSolver will attempt to convert each belief state (often of type DiscreteBelief) into an AbstractArray{Float64} using convert_s. As a convenience, CompressedBeliefMDP implements conversions for commonly used belief types; however, if the POMDP has a custom belief state, then it is the users' responsibility to implement the appropriate conversion. See the source code for help. ","category":"page"},{"location":"compressors/#Implemented-Compressors","page":"Compressors","title":"Implemented Compressors","text":"","category":"section"},{"location":"compressors/","page":"Compressors","title":"Compressors","text":"CompressedBeliefMDPs currently provides wrappers for the following compression types:","category":"page"},{"location":"compressors/","page":"Compressors","title":"Compressors","text":"a principal component analysis (PCA) compressor,\na kernel PCA compressor,\na probabilistic PCA compressor,\na factor analysis compressor,\nan isomap compressor,\nan autoencoder compressor\na variational auto-encoder (VAE) compressor","category":"page"},{"location":"compressors/#Principal-Component-Analysis-(PCA)","page":"Compressors","title":"Principal Component Analysis (PCA)","text":"","category":"section"},{"location":"compressors/","page":"Compressors","title":"Compressors","text":"PCACompressor","category":"page"},{"location":"compressors/#CompressedBeliefMDPs.PCACompressor","page":"Compressors","title":"CompressedBeliefMDPs.PCACompressor","text":"Wrapper for MultivariateStats.PCA.\n\n\n\n\n\n","category":"function"},{"location":"compressors/#Kernel-PCA","page":"Compressors","title":"Kernel PCA","text":"","category":"section"},{"location":"compressors/","page":"Compressors","title":"Compressors","text":"KernelPCACompressor","category":"page"},{"location":"compressors/#CompressedBeliefMDPs.KernelPCACompressor","page":"Compressors","title":"CompressedBeliefMDPs.KernelPCACompressor","text":"Wrapper for MultivariateStats.KernelPCA.\n\n\n\n\n\n","category":"function"},{"location":"compressors/#Probabilistic-PCA","page":"Compressors","title":"Probabilistic PCA","text":"","category":"section"},{"location":"compressors/","page":"Compressors","title":"Compressors","text":"PPCACompressor","category":"page"},{"location":"compressors/#CompressedBeliefMDPs.PPCACompressor","page":"Compressors","title":"CompressedBeliefMDPs.PPCACompressor","text":"Wrapper for MultivariateStats.PPCA.\n\n\n\n\n\n","category":"function"},{"location":"compressors/#Factor-Analysis","page":"Compressors","title":"Factor Analysis","text":"","category":"section"},{"location":"compressors/","page":"Compressors","title":"Compressors","text":"FactorAnalysisCompressor","category":"page"},{"location":"compressors/#CompressedBeliefMDPs.FactorAnalysisCompressor","page":"Compressors","title":"CompressedBeliefMDPs.FactorAnalysisCompressor","text":"Wrapper for MultivariateStats.FactorAnalysis\n\n\n\n\n\n","category":"function"},{"location":"compressors/#Isomap","page":"Compressors","title":"Isomap","text":"","category":"section"},{"location":"compressors/","page":"Compressors","title":"Compressors","text":"IsomapCompressor","category":"page"},{"location":"compressors/#CompressedBeliefMDPs.IsomapCompressor","page":"Compressors","title":"CompressedBeliefMDPs.IsomapCompressor","text":"Wrapper for ManifoldLearning.Isomap.\n\n\n\n\n\n","category":"function"},{"location":"compressors/#Autoencoder","page":"Compressors","title":"Autoencoder","text":"","category":"section"},{"location":"compressors/","page":"Compressors","title":"Compressors","text":"AutoencoderCompressor","category":"page"},{"location":"compressors/#CompressedBeliefMDPs.AutoencoderCompressor","page":"Compressors","title":"CompressedBeliefMDPs.AutoencoderCompressor","text":"Implements an autoencoder in Flux.\n\n\n\n\n\n","category":"type"},{"location":"compressors/#Variational-Auto-Encoder-(VAE)","page":"Compressors","title":"Variational Auto-Encoder (VAE)","text":"","category":"section"},{"location":"compressors/","page":"Compressors","title":"Compressors","text":"VAECompressor","category":"page"},{"location":"compressors/#CompressedBeliefMDPs.VAECompressor","page":"Compressors","title":"CompressedBeliefMDPs.VAECompressor","text":"Implements a VAE in Flux.\n\n\n\n\n\n","category":"type"},{"location":"compressors/","page":"Compressors","title":"Compressors","text":"warning: Warning\nSome compression algorithms aren't optimized for large belief spaces. While they pass our unit tests, they may fail on large POMDPs or without seeding. For large POMDPs, users may want a custom Compressor.","category":"page"},{"location":"circular/#Circular-Maze","page":"Environments","title":"Circular Maze","text":"","category":"section"},{"location":"circular/#Description","page":"Environments","title":"Description","text":"","category":"section"},{"location":"circular/","page":"Environments","title":"Environments","text":"This environment is a generalization of the Circular Maze POMDP described in Finding Approximate POMDP solutions Through Belief Compression.[1] The world consists of n_corridor 1D circular corridors that each have corridor_length states. The robot spawns in a random corridor. It must determine which corridor its in, navigate to the proper goal state, and finally declare that it has finished.","category":"page"},{"location":"circular/","page":"Environments","title":"Environments","text":"(Image: )","category":"page"},{"location":"circular/","page":"Environments","title":"Environments","text":"*Figure from Finding Approximate POMDP solutions Through Belief Compression.","category":"page"},{"location":"circular/","page":"Environments","title":"Environments","text":"[1]: Roy doesn't actually name his toy environment. For the original environment details, see the \"PCA Performance\" subsection on page 8.","category":"page"},{"location":"circular/#Action-Space","page":"Environments","title":"Action Space","text":"","category":"section"},{"location":"circular/","page":"Environments","title":"Environments","text":"Transitions left and right are noisy and non-deterministic. Transition probabilities are from a discrete von Mises distribution with unit concentration and mean at the target state. ","category":"page"},{"location":"circular/","page":"Environments","title":"Environments","text":"Num Action Description\n1 CMAZE_LEFT Move left with von Mises noise.\n2 CMAZE_RIGHT Move right with von Mises noise.\n3 CMAZE_SENSE_CORRIDOR Observe the current corridor.\n4 CMAZE_DECLARE_GOAL Ends the episode. Receive r_findgoal if at the goal.","category":"page"},{"location":"circular/#State-Space","page":"Environments","title":"State Space","text":"","category":"section"},{"location":"circular/","page":"Environments","title":"Environments","text":"The (ordered) state space is an array of all CircularMazeStates and a terminalstate: [CircularMaze(1, 1), ..., CircularMaze(n_corridors, corridor_length), TerminalState()].","category":"page"},{"location":"circular/#Observation-Space","page":"Environments","title":"Observation Space","text":"","category":"section"},{"location":"circular/","page":"Environments","title":"Environments","text":"The observation space is the union of the state space and 1:n_corridors. If the robot picks CMAZE_SENSE_CORRIDOR, they observe the index of the current corridor. Otherwise, they observe their current state with von Mises noise.","category":"page"},{"location":"circular/#Rewards","page":"Environments","title":"Rewards","text":"","category":"section"},{"location":"circular/","page":"Environments","title":"Environments","text":"The goal is to navigate to the correct goal state for the given corridor and then to declare the goal once arrived. If the robot correctly declares the goal, it receives r_findgoal. It incurs a r_timestep_penalty for every timestep it does not reach the goal. By default r_findgoal is 1 and r_timestep_penalty is 0. ","category":"page"},{"location":"circular/#Starting-State","page":"Environments","title":"Starting State","text":"","category":"section"},{"location":"circular/","page":"Environments","title":"Environments","text":"The initial state is sampled from a repeated, discrete von Mises distribution each with a concentration at the center of the hallway. ","category":"page"},{"location":"circular/","page":"Environments","title":"Environments","text":"(Image: )","category":"page"},{"location":"circular/#Episode-End","page":"Environments","title":"Episode End","text":"","category":"section"},{"location":"circular/","page":"Environments","title":"Environments","text":"The episode terminates once the robot declares the goal CMAZE_DECLARE_GOAL regardless of whether the robot is correct.","category":"page"},{"location":"circular/#Documentation","page":"Environments","title":"Documentation","text":"","category":"section"},{"location":"circular/","page":"Environments","title":"Environments","text":"CircularMaze","category":"page"},{"location":"circular/#CompressedBeliefMDPs.CircularMaze","page":"Environments","title":"CompressedBeliefMDPs.CircularMaze","text":"CircularMaze(n_corridors::Integer, corridor_length::Integer, discount::Float64, r_findgoal::Float64, r_timestep_penalty::Float64)\nCircularMaze(n_corridors::Integer, corridor_length::Integer; kwargs...)\nCircularMaze()\n\nA POMDP representing a circular maze environment.\n\nFields\n\nn_corridors::Integer: Number of corridors in the circular maze.\ncorridor_length::Integer: Length of each corridor.\nprobabilities::AbstractArray: Probability masses for creating von Mises distributions.\ncenter::Integer: The central position in the maze.\ndiscount::Float64: Discount factor for future rewards.\nr_findgoal::Float64: Reward for finding the goal.\nr_timestep_penalty::Float64: Penalty for each timestep taken.\nstates::AbstractArray: Array of all possible states in the maze.\ngoals::AbstractArray: Array of goal states in the maze.\n\nExample\n\nusing CompressedBeliefMDPs\n\nn_corridors = 8\ncorridor_length = 25\nmaze = CircularMaze(n_corridors, corridor_length)\n\n\n\n\n\n","category":"type"},{"location":"circular/","page":"Environments","title":"Environments","text":"CircularMazeState","category":"page"},{"location":"circular/#CompressedBeliefMDPs.CircularMazeState","page":"Environments","title":"CompressedBeliefMDPs.CircularMazeState","text":"CircularMazeState(corridor::Integer, x::Integer)\n\nThe CircularMazeState struct represents the state of an agent in a circular maze.\n\nFields\n\ncorridor::Integer: The corridor number. The value ranges from 1 to n_corridors.\nx::Integer: The position of the state within the corridor. The value ranges from 1 to the corridor_length.\n\n\n\n\n\n","category":"type"},{"location":"#CompressedBeliefMDPs.jl","page":"CompressedBeliefMDPs.jl","title":"CompressedBeliefMDPs.jl","text":"","category":"section"},{"location":"#Introduction","page":"CompressedBeliefMDPs.jl","title":"Introduction","text":"","category":"section"},{"location":"","page":"CompressedBeliefMDPs.jl","title":"CompressedBeliefMDPs.jl","text":"Welcome to CompressedBeliefMDPs.jl! This package is part of the POMDPs.jl ecosystem and takes inspiration from Exponential Family PCA for Belief Compression in POMDPs. ","category":"page"},{"location":"","page":"CompressedBeliefMDPs.jl","title":"CompressedBeliefMDPs.jl","text":"This package provides a general framework for applying belief compression in large POMDPs with generic compression, sampling, and planning algorithms.","category":"page"},{"location":"#Installation","page":"CompressedBeliefMDPs.jl","title":"Installation","text":"","category":"section"},{"location":"","page":"CompressedBeliefMDPs.jl","title":"CompressedBeliefMDPs.jl","text":"You can install CompressedBeliefMDPs.jl using Julia's package manager. Open the Julia REPL (press ] to enter the package manager mode) and run the following command:","category":"page"},{"location":"","page":"CompressedBeliefMDPs.jl","title":"CompressedBeliefMDPs.jl","text":"pkg> add CompressedBeliefMDPs","category":"page"},{"location":"#Quickstart","page":"CompressedBeliefMDPs.jl","title":"Quickstart","text":"","category":"section"},{"location":"","page":"CompressedBeliefMDPs.jl","title":"CompressedBeliefMDPs.jl","text":"Using belief compression is easy. Simplify pick a Sampler, Compressor, and a base Policy and then use the standard POMDPs.jl interface.","category":"page"},{"location":"","page":"CompressedBeliefMDPs.jl","title":"CompressedBeliefMDPs.jl","text":"using POMDPs, POMDPTools, POMDPModels\nusing CompressedBeliefMDPs\n\npomdp = BabyPOMDP()\ncompressor = PCACompressor(1)\nupdater = DiscreteUpdater(pomdp)\nsampler = BeliefExpansionSampler(pomdp)\nsolver = CompressedBeliefSolver(\n    pomdp;\n    compressor=compressor,\n    sampler=sampler,\n    updater=updater,\n    verbose=true, \n    max_iterations=100, \n    n_generative_samples=50, \n    k=2\n)\npolicy = solve(solver, pomdp)","category":"page"},{"location":"#Continuous-Example","page":"CompressedBeliefMDPs.jl","title":"Continuous Example","text":"","category":"section"},{"location":"","page":"CompressedBeliefMDPs.jl","title":"CompressedBeliefMDPs.jl","text":"This example demonstrates using CompressedBeliefMDP in a continuous setting with the LightDark1D POMDP. It combines particle filters for belief updating and Monte Carlo Tree Search (MCTS) as the solver. While compressing a 1D space is trivial toy problem, this architecture can be easily scaled to larger POMDPs with continuous state and action spaces.","category":"page"},{"location":"","page":"CompressedBeliefMDPs.jl","title":"CompressedBeliefMDPs.jl","text":"using POMDPs, POMDPModels, POMDPTools\nusing ParticleFilters\nusing MCTS\nusing CompressedBeliefMDPs\n\npomdp = LightDark1D()\npomdp.movement_cost = 1\nbase_solver = MCTSSolver(n_iterations=10, depth=50, exploration_constant=5.0)\nupdater = BootstrapFilter(pomdp, 100)\nsolver = CompressedBeliefSolver(\n    pomdp,\n    base_solver;\n    updater=updater,\n    sampler=PolicySampler(pomdp; updater=updater)\n)\npolicy = solve(solver, pomdp)\nrs = RolloutSimulator(max_steps=50)\nr = simulate(rs, pomdp, policy)","category":"page"},{"location":"#Continuous-Example-2","page":"CompressedBeliefMDPs.jl","title":"Continuous Example","text":"","category":"section"},{"location":"","page":"CompressedBeliefMDPs.jl","title":"CompressedBeliefMDPs.jl","text":"This example demonstrates using CompressedBeliefMDP in a continuous setting with the LightDark1D POMDP. It combines particle filters for belief updating and Monte Carlo Tree Search (MCTS) as the solver. While compressing a 1D space is trivial toy problem, this architecture can be easily scaled to larger POMDPs with continuous state and action spaces.","category":"page"},{"location":"","page":"CompressedBeliefMDPs.jl","title":"CompressedBeliefMDPs.jl","text":"using POMDPs, POMDPModels, POMDPTools\nusing ParticleFilters\nusing MCTS\nusing CompressedBeliefMDPs\n\npomdp = LightDark1D()\npomdp.movement_cost = 1\nbase_solver = MCTSSolver(n_iterations=10, depth=50, exploration_constant=5.0)\nupdater = BootstrapFilter(pomdp, 100)\nsolver = CompressedBeliefSolver(\n    pomdp,\n    base_solver;\n    updater=updater,\n    sampler=PolicySampler(pomdp; updater=updater)\n)\npolicy = solve(solver, pomdp)\nrs = RolloutSimulator(max_steps=50)\nr = simulate(rs, pomdp, policy)","category":"page"},{"location":"","page":"CompressedBeliefMDPs.jl","title":"CompressedBeliefMDPs.jl","text":"Note: We use MCTS here as a proof of concept that CompressedBeliefMDPs can handle continuous state and action spaces. In reality, belief compression has no effect on MCTS with double progressive widening. If you want to solve continuous POMDPs, we suggest implementing a custom solver or looking into Crux.jl.","category":"page"},{"location":"#Large-Example","page":"CompressedBeliefMDPs.jl","title":"Large Example","text":"","category":"section"},{"location":"","page":"CompressedBeliefMDPs.jl","title":"CompressedBeliefMDPs.jl","text":"In this example, we tackle a more realistic scenario with the TMaze POMDP, which has 123 states. To handle the larger state space efficiently, we employ a variational auto-encoder (VAE) to compress the belief simplex. By leveraging the VAE's ability to learn a compact representation of the belief state, we focus computational power on the relevant compressed belief states during each Bellman update.","category":"page"},{"location":"","page":"CompressedBeliefMDPs.jl","title":"CompressedBeliefMDPs.jl","text":"using POMDPs, POMDPModels, POMDPTools\nusing CompressedBeliefMDPs\n\npomdp = TMaze(60, 0.9)\nsolver = CompressedBeliefSolver(\n    pomdp;\n    compressor=VAECompressor(123, 6; hidden_dim=10, verbose=true, epochs=2),\n    sampler=PolicySampler(pomdp, n=500),\n    verbose=true, \n    max_iterations=1000, \n    n_generative_samples=30,\n    k=2\n)\npolicy = solve(solver, pomdp)\nrs = RolloutSimulator(max_steps=50)\nr = simulate(rs, pomdp, policy)","category":"page"},{"location":"#Concepts-and-Architecture","page":"CompressedBeliefMDPs.jl","title":"Concepts and Architecture","text":"","category":"section"},{"location":"","page":"CompressedBeliefMDPs.jl","title":"CompressedBeliefMDPs.jl","text":"CompressedBeliefMDPs.jl aims to implement a generalization of the belief compression algorithm for solving large POMDPs. The algorithm has four steps:","category":"page"},{"location":"","page":"CompressedBeliefMDPs.jl","title":"CompressedBeliefMDPs.jl","text":"collect belief samples,\ncompress the samples,\ncreate the compressed belief-state MDP,\nsolve the MDP.","category":"page"},{"location":"","page":"CompressedBeliefMDPs.jl","title":"CompressedBeliefMDPs.jl","text":"Each step is handled by Sampler, Compressor, CompressedBeliefMDP, and CompressedBeliefSolver respectively.","category":"page"},{"location":"","page":"CompressedBeliefMDPs.jl","title":"CompressedBeliefMDPs.jl","text":"For more details, please see the rest of the documentation or the associated paper.","category":"page"},{"location":"samplers/#Samplers","page":"Samplers","title":"Samplers","text":"","category":"section"},{"location":"samplers/#Defining-a-Sampler","page":"Samplers","title":"Defining a Sampler","text":"","category":"section"},{"location":"samplers/","page":"Samplers","title":"Samplers","text":"In this section, we outline the requirements and guidelines for defining a belief Sampler.","category":"page"},{"location":"samplers/#Interface","page":"Samplers","title":"Interface","text":"","category":"section"},{"location":"samplers/","page":"Samplers","title":"Samplers","text":"The Sampler interface only has one method: the functor. For example, if you wanted to implement your own Sampler, you could write something like this","category":"page"},{"location":"samplers/","page":"Samplers","title":"Samplers","text":"struct MySampler <: Compressor\n    foo\n    bar\nend\n\n# functor definition\nfunction (c::MySampler)(pomdp::POMDP)\n    # YOUR CODE HERE\n    return sampled_beliefs\nend","category":"page"},{"location":"samplers/#Implemented-Sampler","page":"Samplers","title":"Implemented Sampler","text":"","category":"section"},{"location":"samplers/","page":"Samplers","title":"Samplers","text":"CompressedBeliefMDPs provides the following generic belief samplers:","category":"page"},{"location":"samplers/","page":"Samplers","title":"Samplers","text":"an exploratory belief expansion sampler\na Policy rollout sampler\nan ExplorationPolicy rollout sampler","category":"page"},{"location":"samplers/#Exploratory-Belief-Expansion","page":"Samplers","title":"Exploratory Belief Expansion","text":"","category":"section"},{"location":"samplers/","page":"Samplers","title":"Samplers","text":"BeliefExpansionSampler","category":"page"},{"location":"samplers/#CompressedBeliefMDPs.BeliefExpansionSampler","page":"Samplers","title":"CompressedBeliefMDPs.BeliefExpansionSampler","text":"BeliefExpansionSampler\n\nFast extension of exploratory belief expansion (Algorithm 21.13 in Algorithms for Decision Making) that uses k-d trees.\n\nFields\n\nupdater::Updater: The updater used to update beliefs.\nmetric::NearestNeighbors.MinkowskiMetric: The metric used to measure distances between beliefs.\n\nIt must be a Minkowski metric.\n\nn::Integer: The number of belief expansions to perform.\n\nConstructors\n\nBeliefExpansionSampler(pomdp::POMDP; updater::Updater=DiscreteUpdater(pomdp),\nmetric::NearestNeighbors.MinkowskiMetric=Euclidean(), n::Integer=3)\n\nMethods\n\n(s::BeliefExpansionSampler)(pomdp::POMDP)\n\nCreates an initial belief and performs exploratory belief expansion. Returns the unique belief states.  Only works for POMDPs with discrete state, action, and observation spaces.\n\nExample Usage\n\njulia> pomdp = TigerPOMDP();\njulia> sampler = BeliefExpansionSampler(pomdp; n=2);\njulia> beliefs = sampler(pomdp)\nSet{DiscreteBelief{TigerPOMDP, Bool}} with 4 elements:\n  DiscreteBelief{TigerPOMDP, Bool}(TigerPOMDP(-1.0, -100.0, 10.0, 0.85, 0.95), Bool[0, 1], [0.15000000000000002, 0.85])\n  DiscreteBelief{TigerPOMDP, Bool}(TigerPOMDP(-1.0, -100.0, 10.0, 0.85, 0.95), Bool[0, 1], [0.5, 0.5])\n  DiscreteBelief{TigerPOMDP, Bool}(TigerPOMDP(-1.0, -100.0, 10.0, 0.85, 0.95), Bool[0, 1], [0.85, 0.15000000000000002])\n  DiscreteBelief{TigerPOMDP, Bool}(TigerPOMDP(-1.0, -100.0, 10.0, 0.85, 0.95), Bool[0, 1], [0.9697986577181208, 0.030201342281879207])\n\n\n\n\n\n","category":"type"},{"location":"samplers/#Policy-Sampler","page":"Samplers","title":"Policy Sampler","text":"","category":"section"},{"location":"samplers/","page":"Samplers","title":"Samplers","text":"PolicySampler","category":"page"},{"location":"samplers/#CompressedBeliefMDPs.PolicySampler","page":"Samplers","title":"CompressedBeliefMDPs.PolicySampler","text":"PolicySampler\n\nSamples belief states by rolling out a Policy.\n\nFields\n\npolicy::Policy: The policy used for decision making.\nupdater::Updater: The updater used for updating beliefs.\nn::Integer: The maximum number of simulated steps.\nrng::AbstractRNG: The random number generator used for sampling.\nverbose::Bool: Whether to use a progress bar while sampling.\n\nConstructors\n\nPolicySampler(pomdp::POMDP; policy::Policy=RandomPolicy(pomdp), \nupdater::Updater=DiscreteUpdater(pomdp), n::Integer=10, \nrng::AbstractRNG=Random.GLOBAL_RNG)\n\nMethods\n\n(s::PolicySampler)(pomdp::POMDP)\n\nReturns a vector of unique belief states.\n\nExample\n\njulia> pomdp = TigerPOMDP();\njulia> sampler = PolicySampler(pomdp; n=3); \njulia> 2-element Vector{Any}:\nDiscreteBelief{TigerPOMDP, Bool}(TigerPOMDP(-1.0, -100.0, 10.0, 0.85, 0.95), Bool[0, 1], [0.5, 0.5])\nDiscreteBelief{TigerPOMDP, Bool}(TigerPOMDP(-1.0, -100.0, 10.0, 0.85, 0.95), Bool[0, 1], [0.15000000000000002, 0.85])\n\n\n\n\n\n","category":"type"},{"location":"samplers/#ExplorationPolicy-Sampler","page":"Samplers","title":"ExplorationPolicy Sampler","text":"","category":"section"},{"location":"samplers/","page":"Samplers","title":"Samplers","text":"ExplorationPolicySampler","category":"page"},{"location":"samplers/#CompressedBeliefMDPs.ExplorationPolicySampler","page":"Samplers","title":"CompressedBeliefMDPs.ExplorationPolicySampler","text":"ExplorationPolicySampler\n\nSamples belief states by rolling out an ExplorationPolicy. Essentially identical to PolicySampler.\n\nFields\n\nexplorer::ExplorationPolicy: The ExplorationPolicy used for decision making.\non_policy::Policy: The fallback Policy used for decision making when not exploring.\nupdater::Updater: The updater used for updating beliefs.\nn::Integer: The maximum number of simulated steps.\nrng::AbstractRNG: The random number generator used for sampling.\nverbose::Bool: Whether to use a progress bar while sampling.\n\nConstructors\n\nExplorationPolicySampler(pomdp::POMDP; rng::AbstractRNG=Random.GLOBAL_RNG,\nexplorer::ExplorationPolicy=EpsGreedyPolicy(pomdp, 0.1; rng=rng), on_policy=RandomPolicy(pomdp),\nupdater::Updater=DiscreteUpdater(pomdp), n::Integer=10)\n\nMethods\n\n(s::ExplorationPolicySampler)(pomdp::POMDP)\n\nReturns a vector of unique belief states.\n\nExample Usage\n\njulia> pomdp = TigerPOMDP()\njulia> sampler = ExplorationPolicySampler(pomdp; n=30)\njulia> sampler(pomdp)\n3-element Vector{Any}:\n DiscreteBelief{TigerPOMDP, Bool}(TigerPOMDP(-1.0, -100.0, 10.0, 0.85, 0.95), Bool[0, 1], [0.5, 0.5])\n DiscreteBelief{TigerPOMDP, Bool}(TigerPOMDP(-1.0, -100.0, 10.0, 0.85, 0.95), Bool[0, 1], [0.85, 0.15000000000000002])\n DiscreteBelief{TigerPOMDP, Bool}(TigerPOMDP(-1.0, -100.0, 10.0, 0.85, 0.95), Bool[0, 1], [0.9697986577181208, 0.030201342281879207])\n\n\n\n\n\n","category":"type"}]
+[{"location":"api/#API-Documentation","page":"API Documentation","title":"API Documentation","text":"","category":"section"},{"location":"api/","page":"API Documentation","title":"API Documentation","text":"CurrentModule = CompressedBeliefMDPs","category":"page"},{"location":"api/#Contents","page":"API Documentation","title":"Contents","text":"","category":"section"},{"location":"api/","page":"API Documentation","title":"API Documentation","text":"Pages = [\"api.md\"]","category":"page"},{"location":"api/#Index","page":"API Documentation","title":"Index","text":"","category":"section"},{"location":"api/","page":"API Documentation","title":"API Documentation","text":"Pages = [\"api.md\"]","category":"page"},{"location":"api/#Types/Functors","page":"API Documentation","title":"Types/Functors","text":"","category":"section"},{"location":"api/","page":"API Documentation","title":"API Documentation","text":"Sampler\nCompressor\nCompressedBeliefMDP\nCompressedBeliefPolicy\nCompressedBeliefSolver","category":"page"},{"location":"api/#CompressedBeliefMDPs.Sampler","page":"API Documentation","title":"CompressedBeliefMDPs.Sampler","text":"Abstract type for an object that defines how the belief should be sampled.\n\n\n\n\n\n","category":"type"},{"location":"api/#CompressedBeliefMDPs.Compressor","page":"API Documentation","title":"CompressedBeliefMDPs.Compressor","text":"Abstract type for an object that defines how the belief should be compressed.\n\n\n\n\n\n","category":"type"},{"location":"api/#CompressedBeliefMDPs.CompressedBeliefMDP","page":"API Documentation","title":"CompressedBeliefMDPs.CompressedBeliefMDP","text":"CompressedBeliefMDP{B, A}\n\nThe CompressedBeliefMDP struct is a generalization of the compressed belief-state MDP presented in  Exponential Family PCA for Belief Compression in POMDPs.\n\nType Parameters\n\nB: The type of compressed belief states.\nA: The type of actions.\n\nFields\n\nbmdp::GenerativeBeliefMDP: The generative belief-state MDP.\ncompressor::Compressor: The compressor used to compress belief states.\nϕ::Bijection: A bijection representing the mapping from uncompressed belief states to compressed belief states. See notes. \n\nConstructors\n\nCompressedBeliefMDP(pomdp::POMDP, updater::Updater, compressor::Compressor)\nCompressedBeliefMDP(pomdp::POMDP, sampler::Sampler, updater::Updater, compressor::Compressor)\n\nConstructs a CompressedBeliefMDP using the specified POMDP, updater, and compressor.\n\nwarning: Warning\nThe 4-argument constructor is a quality-of-life constructor that calls fit! on the given compressor. \n\nExample Usage\n\npomdp = TigerPOMDP()\nupdater = DiscreteUpdater(pomdp)\ncompressor = PCACompressor(1)\nmdp = CompressedBeliefMDP(pomdp, updater, compressor)\n\nFor continuous POMDPs, see ParticleFilters.jl.\n\nNotes\n\nWhile compressions aren't usually injective, we cache beliefs and their compressions on a first-come, first-served basis, so we can effectively use a bijection without loss of generality.\n\n\n\n\n\n","category":"type"},{"location":"api/#CompressedBeliefMDPs.CompressedBeliefPolicy","page":"API Documentation","title":"CompressedBeliefMDPs.CompressedBeliefPolicy","text":"CompressedBeliefPolicy\n\nMaps a base policy for the compressed belief-state MDP to a policy for the true POMDP.\n\nFields\n\nm::CompressedBeliefMDP: The compressed belief-state MDP.\nbase_policy::Policy: The base policy used for decision-making in the compressed belief-state MDP.\n\nConstructors\n\nCompressedBeliefPolicy(m::CompressedBeliefMDP, base_policy::Policy)\n\nConstructs a CompressedBeliefPolicy using the specified compressed belief-state MDP and base policy.\n\nExample Usage\n\npolicy = solve(solver, pomdp)\ns = initialstate(pomdp)\na = action(policy, s) # returns the approximately optimal action for state s\nv = value(policy, s)  # returns the approximately optimal value for state s\n\n\n\n\n\n","category":"type"},{"location":"api/#CompressedBeliefMDPs.CompressedBeliefSolver","page":"API Documentation","title":"CompressedBeliefMDPs.CompressedBeliefSolver","text":"CompressedBeliefSolver\n\nThe CompressedBeliefSolver struct represents a solver for compressed belief-state MDPs. It combines a compressed belief-state MDP with a base solver to approximate the value function.\n\nFields\n\nm::CompressedBeliefMDP: The compressed belief-state MDP.\nbase_solver::Solver: The base solver used to solve the compressed belief-state MDP.\n\nConstructors\n\nCompressedBeliefSolver(pomdp::POMDP, base_solver::Solver; updater::Updater=DiscreteUpdater(pomdp), sampler::Sampler=BeliefExpansionSampler(pomdp), compressor::Compressor=PCACompressor(1))\nCompressedBeliefSolver(pomdp::POMDP; updater::Updater=DiscreteUpdater(pomdp), sampler::Sampler=BeliefExpansionSampler(pomdp), compressor::Compressor=PCACompressor(1), interp::Union{Nothing, LocalFunctionApproximator}=nothing, k::Int=1, verbose::Bool=false, max_iterations::Int=1000, n_generative_samples::Int=10, belres::Float64=1e-3)\n\nConstructs a CompressedBeliefSolver using the specified POMDP, base solver, updater, sampler, and compressor. Alternatively, you can omit the base solver in which case a LocalApproximationValueIterationSolver(https://github.com/JuliaPOMDP/LocalApproximationValueIteration.jl) will be created instead. For example, different base solvers are needed if the POMDP state and action space are continuous.\n\nExample Usage\n\njulia> pomdp = TigerPOMDP();\njulia> solver = CompressedBeliefSolver(pomdp; verbose=true, max_iterations=10);\njulia> solve(solver, pomdp);\n[Iteration 1   ] residual:       8.51 | iteration runtime:    635.870 ms, (     0.636 s total)\n[Iteration 2   ] residual:       3.63 | iteration runtime:      0.504 ms, (     0.636 s total)\n[Iteration 3   ] residual:       10.1 | iteration runtime:      0.445 ms, (     0.637 s total)\n[Iteration 4   ] residual:       15.2 | iteration runtime:      0.494 ms, (     0.637 s total)\n[Iteration 5   ] residual:       6.72 | iteration runtime:      0.432 ms, (     0.638 s total)\n[Iteration 6   ] residual:       7.38 | iteration runtime:      0.508 ms, (     0.638 s total)\n[Iteration 7   ] residual:       6.03 | iteration runtime:      0.495 ms, (     0.639 s total)\n[Iteration 8   ] residual:       5.73 | iteration runtime:      0.585 ms, (     0.639 s total)\n[Iteration 9   ] residual:       4.02 | iteration runtime:      0.463 ms, (      0.64 s total)\n[Iteration 10  ] residual:       7.28 | iteration runtime:      0.576 ms, (      0.64 s total)\n\n\n\n\n\n","category":"type"},{"location":"api/#Functions","page":"API Documentation","title":"Functions","text":"","category":"section"},{"location":"api/","page":"API Documentation","title":"API Documentation","text":"fit!\nmake_cache\nmake_numerical\ncompress_POMDP","category":"page"},{"location":"api/#CompressedBeliefMDPs.fit!","page":"API Documentation","title":"CompressedBeliefMDPs.fit!","text":"fit!(compressor::Compressor, beliefs)\n\nFit the compressor to beliefs.\n\n\n\n\n\n","category":"function"},{"location":"api/#CompressedBeliefMDPs.make_cache","page":"API Documentation","title":"CompressedBeliefMDPs.make_cache","text":"make_cache(B, B̃)\n\nHelper function that creates a cache that maps each unique belief from the set B to its corresponding compressed representation in B̃.\n\nArguments\n\nB::Vector{<:Any}: A vector of beliefs.\nB̃::Matrix{Float64}: A matrix where each row corresponds to the compressed representation of the beliefs in B.\n\nReturns\n\nDict{<:Any, Vector{Float64}}: A dictionary mapping each unique belief in B to its corresponding compressed representation in B̃.\n\nExample Usage\n\nB = [belief1, belief2, belief3]\nB̃ = [compressed_belief1; compressed_belief2; compressed_belief3]\nϕ = make_cache(B, B̃)\n\n\n\n\n\n","category":"function"},{"location":"api/#CompressedBeliefMDPs.make_numerical","page":"API Documentation","title":"CompressedBeliefMDPs.make_numerical","text":"make_numerical(B, pomdp)\n\nHelper function that converts a set of beliefs B into a numerical matrix representation suitable for processing by numerical algorithms/compressors.\n\nArguments\n\nB::Vector{<:Any}: A vector of beliefs.\npomdp::POMDP: The POMDP model associated with the beliefs.\n\nReturns\n\nMatrix{Float64}: A matrix where each row corresponds to a numerical representation of a belief in B.\n\nExample Usage\n\nB = [belief1, belief2, belief3]\nB_numerical = make_numerical(B, pomdp)\n\n\n\n\n\n","category":"function"},{"location":"api/#CompressedBeliefMDPs.compress_POMDP","page":"API Documentation","title":"CompressedBeliefMDPs.compress_POMDP","text":"compress_POMDP(pomdp, sampler, updater, compressor)\n\nCreates a compressed belief-state MDP by sampling, compressing, and caching beliefs from the given POMDP.\n\nArguments\n\npomdp::POMDP: The POMDP model to be compressed.\nsampler::Sampler: A sampler to generate a set of beliefs from the POMDP.\nupdater::Updater: An updater to initialize beliefs from states.\ncompressor::Compressor: A compressor to reduce the dimensionality of the beliefs.\n\nReturns\n\nCompressedBeliefMDP: The constructed compressed belief-state MDP.\nMatrix{Float64}: A matrix where each row corresponds to the compressed representation of the sampled beliefs.\n\nExample Usage\n\n```julia pomdp = TigerPOMDP() sampler = BeliefExpansionSampler(pomdp) updater = DiscreteUpdater(pomdp) compressor = PCACompressor(2) m, B̃ = compress_POMDP(pomdp, sampler, updater, compressor)\n\n\n\n\n\n","category":"function"},{"location":"compressors/#Compressors","page":"Compressors","title":"Compressors","text":"","category":"section"},{"location":"compressors/#Defining-a-Belief-Compressor","page":"Compressors","title":"Defining a Belief Compressor","text":"","category":"section"},{"location":"compressors/","page":"Compressors","title":"Compressors","text":"In this section, we outline the requirements and guidelines for defining a belief Compressor.","category":"page"},{"location":"compressors/#Interface","page":"Compressors","title":"Interface","text":"","category":"section"},{"location":"compressors/","page":"Compressors","title":"Compressors","text":"The Compressor interface is extremely minimal. It only supports two methods: fit! and the associated functor. For example, if you wanted to implement your own Compressor, you could write something like this","category":"page"},{"location":"compressors/","page":"Compressors","title":"Compressors","text":"struct MyCompressor <: Compressor\n    foo\n    bar\nend\n\n# functor definition\nfunction (c::MyCompressor)(beliefs)\n    # YOUR CODE HERE\n    return compressed_beliefs\nend\n\nfunction fit!(c::MyCompressor, beliefs)\n    # YOUR CODE HERE\nend","category":"page"},{"location":"compressors/#Implementation-Tips","page":"Compressors","title":"Implementation Tips","text":"","category":"section"},{"location":"compressors/","page":"Compressors","title":"Compressors","text":"For robustness, both the functor and fit! should be able to handle AbstractVector and AbstractMatrix inputs. \nfit! is called only once after beliefs are sampled from the POMDP.\nCompressedBeliefSolver will attempt to convert each belief state (often of type DiscreteBelief) into an AbstractArray{Float64} using convert_s. As a convenience, CompressedBeliefMDP implements conversions for commonly used belief types; however, if the POMDP has a custom belief state, then it is the users' responsibility to implement the appropriate conversion. See the source code for help. ","category":"page"},{"location":"compressors/#Implemented-Compressors","page":"Compressors","title":"Implemented Compressors","text":"","category":"section"},{"location":"compressors/","page":"Compressors","title":"Compressors","text":"CompressedBeliefMDPs currently provides wrappers for the following compression types:","category":"page"},{"location":"compressors/","page":"Compressors","title":"Compressors","text":"a principal component analysis (PCA) compressor,\na kernel PCA compressor,\na probabilistic PCA compressor,\na factor analysis compressor,\nan isomap compressor,\nan autoencoder compressor\na variational auto-encoder (VAE) compressor","category":"page"},{"location":"compressors/#Principal-Component-Analysis-(PCA)","page":"Compressors","title":"Principal Component Analysis (PCA)","text":"","category":"section"},{"location":"compressors/","page":"Compressors","title":"Compressors","text":"PCACompressor","category":"page"},{"location":"compressors/#CompressedBeliefMDPs.PCACompressor","page":"Compressors","title":"CompressedBeliefMDPs.PCACompressor","text":"Wrapper for MultivariateStats.PCA.\n\n\n\n\n\n","category":"function"},{"location":"compressors/#Kernel-PCA","page":"Compressors","title":"Kernel PCA","text":"","category":"section"},{"location":"compressors/","page":"Compressors","title":"Compressors","text":"KernelPCACompressor","category":"page"},{"location":"compressors/#CompressedBeliefMDPs.KernelPCACompressor","page":"Compressors","title":"CompressedBeliefMDPs.KernelPCACompressor","text":"Wrapper for MultivariateStats.KernelPCA.\n\n\n\n\n\n","category":"function"},{"location":"compressors/#Probabilistic-PCA","page":"Compressors","title":"Probabilistic PCA","text":"","category":"section"},{"location":"compressors/","page":"Compressors","title":"Compressors","text":"PPCACompressor","category":"page"},{"location":"compressors/#CompressedBeliefMDPs.PPCACompressor","page":"Compressors","title":"CompressedBeliefMDPs.PPCACompressor","text":"Wrapper for MultivariateStats.PPCA.\n\n\n\n\n\n","category":"function"},{"location":"compressors/#Factor-Analysis","page":"Compressors","title":"Factor Analysis","text":"","category":"section"},{"location":"compressors/","page":"Compressors","title":"Compressors","text":"FactorAnalysisCompressor","category":"page"},{"location":"compressors/#CompressedBeliefMDPs.FactorAnalysisCompressor","page":"Compressors","title":"CompressedBeliefMDPs.FactorAnalysisCompressor","text":"Wrapper for MultivariateStats.FactorAnalysis\n\n\n\n\n\n","category":"function"},{"location":"compressors/#Isomap","page":"Compressors","title":"Isomap","text":"","category":"section"},{"location":"compressors/","page":"Compressors","title":"Compressors","text":"IsomapCompressor","category":"page"},{"location":"compressors/#CompressedBeliefMDPs.IsomapCompressor","page":"Compressors","title":"CompressedBeliefMDPs.IsomapCompressor","text":"Wrapper for ManifoldLearning.Isomap.\n\n\n\n\n\n","category":"function"},{"location":"compressors/#Autoencoder","page":"Compressors","title":"Autoencoder","text":"","category":"section"},{"location":"compressors/","page":"Compressors","title":"Compressors","text":"AutoencoderCompressor","category":"page"},{"location":"compressors/#CompressedBeliefMDPs.AutoencoderCompressor","page":"Compressors","title":"CompressedBeliefMDPs.AutoencoderCompressor","text":"Implements an autoencoder in Flux.\n\n\n\n\n\n","category":"type"},{"location":"compressors/#Variational-Auto-Encoder-(VAE)","page":"Compressors","title":"Variational Auto-Encoder (VAE)","text":"","category":"section"},{"location":"compressors/","page":"Compressors","title":"Compressors","text":"VAECompressor","category":"page"},{"location":"compressors/#CompressedBeliefMDPs.VAECompressor","page":"Compressors","title":"CompressedBeliefMDPs.VAECompressor","text":"Implements a VAE in Flux.\n\n\n\n\n\n","category":"type"},{"location":"compressors/","page":"Compressors","title":"Compressors","text":"warning: Warning\nSome compression algorithms aren't optimized for large belief spaces. While they pass our unit tests, they may fail on large POMDPs or without seeding. For large POMDPs, users may want a custom Compressor.","category":"page"},{"location":"circular/#Circular-Maze","page":"Environments","title":"Circular Maze","text":"","category":"section"},{"location":"circular/#Description","page":"Environments","title":"Description","text":"","category":"section"},{"location":"circular/","page":"Environments","title":"Environments","text":"This environment is a generalization of the Circular Maze POMDP described in Finding Approximate POMDP solutions Through Belief Compression.[1] The world consists of n_corridor 1D circular corridors that each have corridor_length states. The robot spawns in a random corridor. It must determine which corridor its in, navigate to the proper goal state, and finally declare that it has finished.","category":"page"},{"location":"circular/","page":"Environments","title":"Environments","text":"(Image: )","category":"page"},{"location":"circular/","page":"Environments","title":"Environments","text":"Figure from Finding Approximate POMDP solutions Through Belief Compression.","category":"page"},{"location":"circular/","page":"Environments","title":"Environments","text":"[1]: Roy doesn't actually name his toy environment. For the original environment details, see the \"PCA Performance\" subsection on page 8.","category":"page"},{"location":"circular/#Action-Space","page":"Environments","title":"Action Space","text":"","category":"section"},{"location":"circular/","page":"Environments","title":"Environments","text":"Transitions left and right are noisy and non-deterministic. Transition probabilities are from a discrete von Mises distribution with unit concentration and mean at the target state. ","category":"page"},{"location":"circular/","page":"Environments","title":"Environments","text":"Num Action Description\n1 CMAZE_LEFT Move left with von Mises noise.\n2 CMAZE_RIGHT Move right with von Mises noise.\n3 CMAZE_SENSE_CORRIDOR Observe the current corridor.\n4 CMAZE_DECLARE_GOAL Ends the episode. Receive r_findgoal if at the goal.","category":"page"},{"location":"circular/#State-Space","page":"Environments","title":"State Space","text":"","category":"section"},{"location":"circular/","page":"Environments","title":"Environments","text":"The (ordered) state space is an array of all CircularMazeStates and a terminalstate: [CircularMaze(1, 1), ..., CircularMaze(n_corridors, corridor_length), TerminalState()].","category":"page"},{"location":"circular/#Observation-Space","page":"Environments","title":"Observation Space","text":"","category":"section"},{"location":"circular/","page":"Environments","title":"Environments","text":"The observation space is the union of the state space and 1:n_corridors. If the robot picks CMAZE_SENSE_CORRIDOR, they observe the index of the current corridor. Otherwise, they observe their current state with von Mises noise.","category":"page"},{"location":"circular/#Rewards","page":"Environments","title":"Rewards","text":"","category":"section"},{"location":"circular/","page":"Environments","title":"Environments","text":"The goal is to navigate to the correct goal state for the given corridor and then to declare the goal once arrived. If the robot correctly declares the goal, it receives r_findgoal. It incurs a r_timestep_penalty for every timestep it does not reach the goal. By default r_findgoal is 1 and r_timestep_penalty is 0. ","category":"page"},{"location":"circular/#Starting-State","page":"Environments","title":"Starting State","text":"","category":"section"},{"location":"circular/","page":"Environments","title":"Environments","text":"The initial state is sampled from a repeated, discrete von Mises distribution each with a concentration at the center of the hallway. ","category":"page"},{"location":"circular/","page":"Environments","title":"Environments","text":"(Image: )","category":"page"},{"location":"circular/#Episode-End","page":"Environments","title":"Episode End","text":"","category":"section"},{"location":"circular/","page":"Environments","title":"Environments","text":"The episode terminates once the robot declares the goal CMAZE_DECLARE_GOAL regardless of whether the robot is correct.","category":"page"},{"location":"circular/#Documentation","page":"Environments","title":"Documentation","text":"","category":"section"},{"location":"circular/","page":"Environments","title":"Environments","text":"CircularMaze","category":"page"},{"location":"circular/#CompressedBeliefMDPs.CircularMaze","page":"Environments","title":"CompressedBeliefMDPs.CircularMaze","text":"CircularMaze(n_corridors::Integer, corridor_length::Integer, discount::Float64, r_findgoal::Float64, r_timestep_penalty::Float64)\nCircularMaze(n_corridors::Integer, corridor_length::Integer; kwargs...)\nCircularMaze()\n\nA POMDP representing a circular maze environment.\n\nFields\n\nn_corridors::Integer: Number of corridors in the circular maze.\ncorridor_length::Integer: Length of each corridor.\nprobabilities::AbstractArray: Probability masses for creating von Mises distributions.\ncenter::Integer: The central position in the maze.\ndiscount::Float64: Discount factor for future rewards.\nr_findgoal::Float64: Reward for finding the goal.\nr_timestep_penalty::Float64: Penalty for each timestep taken.\nstates::AbstractArray: Array of all possible states in the maze.\ngoals::AbstractArray: Array of goal states in the maze.\n\nExample\n\nusing CompressedBeliefMDPs\n\nn_corridors = 8\ncorridor_length = 25\nmaze = CircularMaze(n_corridors, corridor_length)\n\n\n\n\n\n","category":"type"},{"location":"circular/","page":"Environments","title":"Environments","text":"CircularMazeState","category":"page"},{"location":"circular/#CompressedBeliefMDPs.CircularMazeState","page":"Environments","title":"CompressedBeliefMDPs.CircularMazeState","text":"CircularMazeState(corridor::Integer, x::Integer)\n\nThe CircularMazeState struct represents the state of an agent in a circular maze.\n\nFields\n\ncorridor::Integer: The corridor number. The value ranges from 1 to n_corridors.\nx::Integer: The position of the state within the corridor. The value ranges from 1 to the corridor_length.\n\n\n\n\n\n","category":"type"},{"location":"#CompressedBeliefMDPs.jl","page":"CompressedBeliefMDPs.jl","title":"CompressedBeliefMDPs.jl","text":"","category":"section"},{"location":"#Introduction","page":"CompressedBeliefMDPs.jl","title":"Introduction","text":"","category":"section"},{"location":"","page":"CompressedBeliefMDPs.jl","title":"CompressedBeliefMDPs.jl","text":"Welcome to CompressedBeliefMDPs.jl! This package is part of the POMDPs.jl ecosystem and takes inspiration from Exponential Family PCA for Belief Compression in POMDPs. ","category":"page"},{"location":"","page":"CompressedBeliefMDPs.jl","title":"CompressedBeliefMDPs.jl","text":"This package provides a general framework for applying belief compression in large POMDPs with generic compression, sampling, and planning algorithms.","category":"page"},{"location":"#Installation","page":"CompressedBeliefMDPs.jl","title":"Installation","text":"","category":"section"},{"location":"","page":"CompressedBeliefMDPs.jl","title":"CompressedBeliefMDPs.jl","text":"You can install CompressedBeliefMDPs.jl using Julia's package manager. Open the Julia REPL (press ] to enter the package manager mode) and run the following command:","category":"page"},{"location":"","page":"CompressedBeliefMDPs.jl","title":"CompressedBeliefMDPs.jl","text":"pkg> add CompressedBeliefMDPs","category":"page"},{"location":"#Quickstart","page":"CompressedBeliefMDPs.jl","title":"Quickstart","text":"","category":"section"},{"location":"","page":"CompressedBeliefMDPs.jl","title":"CompressedBeliefMDPs.jl","text":"Using belief compression is easy. Simplify pick a Sampler, Compressor, and a base Policy and then use the standard POMDPs.jl interface.","category":"page"},{"location":"","page":"CompressedBeliefMDPs.jl","title":"CompressedBeliefMDPs.jl","text":"using POMDPs, POMDPTools, POMDPModels\nusing CompressedBeliefMDPs\n\npomdp = BabyPOMDP()\ncompressor = PCACompressor(1)\nupdater = DiscreteUpdater(pomdp)\nsampler = BeliefExpansionSampler(pomdp)\nsolver = CompressedBeliefSolver(\n    pomdp;\n    compressor=compressor,\n    sampler=sampler,\n    updater=updater,\n    verbose=true, \n    max_iterations=100, \n    n_generative_samples=50, \n    k=2\n)\npolicy = solve(solver, pomdp)","category":"page"},{"location":"#Continuous-Example","page":"CompressedBeliefMDPs.jl","title":"Continuous Example","text":"","category":"section"},{"location":"","page":"CompressedBeliefMDPs.jl","title":"CompressedBeliefMDPs.jl","text":"This example demonstrates using CompressedBeliefMDP in a continuous setting with the LightDark1D POMDP. It combines particle filters for belief updating and Monte Carlo Tree Search (MCTS) as the solver. While compressing a 1D space is trivial toy problem, this architecture can be easily scaled to larger POMDPs with continuous state and action spaces.","category":"page"},{"location":"","page":"CompressedBeliefMDPs.jl","title":"CompressedBeliefMDPs.jl","text":"using POMDPs, POMDPModels, POMDPTools\nusing ParticleFilters\nusing MCTS\nusing CompressedBeliefMDPs\n\npomdp = LightDark1D()\npomdp.movement_cost = 1\nbase_solver = MCTSSolver(n_iterations=10, depth=50, exploration_constant=5.0)\nupdater = BootstrapFilter(pomdp, 100)\nsolver = CompressedBeliefSolver(\n    pomdp,\n    base_solver;\n    updater=updater,\n    sampler=PolicySampler(pomdp; updater=updater)\n)\npolicy = solve(solver, pomdp)\nrs = RolloutSimulator(max_steps=50)\nr = simulate(rs, pomdp, policy)","category":"page"},{"location":"#Continuous-Example-2","page":"CompressedBeliefMDPs.jl","title":"Continuous Example","text":"","category":"section"},{"location":"","page":"CompressedBeliefMDPs.jl","title":"CompressedBeliefMDPs.jl","text":"This example demonstrates using CompressedBeliefMDP in a continuous setting with the LightDark1D POMDP. It combines particle filters for belief updating and Monte Carlo Tree Search (MCTS) as the solver. While compressing a 1D space is trivial toy problem, this architecture can be easily scaled to larger POMDPs with continuous state and action spaces.","category":"page"},{"location":"","page":"CompressedBeliefMDPs.jl","title":"CompressedBeliefMDPs.jl","text":"using POMDPs, POMDPModels, POMDPTools\nusing ParticleFilters\nusing MCTS\nusing CompressedBeliefMDPs\n\npomdp = LightDark1D()\npomdp.movement_cost = 1\nbase_solver = MCTSSolver(n_iterations=10, depth=50, exploration_constant=5.0)\nupdater = BootstrapFilter(pomdp, 100)\nsolver = CompressedBeliefSolver(\n    pomdp,\n    base_solver;\n    updater=updater,\n    sampler=PolicySampler(pomdp; updater=updater)\n)\npolicy = solve(solver, pomdp)\nrs = RolloutSimulator(max_steps=50)\nr = simulate(rs, pomdp, policy)","category":"page"},{"location":"","page":"CompressedBeliefMDPs.jl","title":"CompressedBeliefMDPs.jl","text":"Note: We use MCTS here as a proof of concept that CompressedBeliefMDPs can handle continuous state and action spaces. In reality, belief compression has no effect on MCTS with double progressive widening. If you want to solve continuous POMDPs, we suggest implementing a custom solver or looking into Crux.jl.","category":"page"},{"location":"#Large-Example","page":"CompressedBeliefMDPs.jl","title":"Large Example","text":"","category":"section"},{"location":"","page":"CompressedBeliefMDPs.jl","title":"CompressedBeliefMDPs.jl","text":"In this example, we tackle a more realistic scenario with the TMaze POMDP, which has 123 states. To handle the larger state space efficiently, we employ a variational auto-encoder (VAE) to compress the belief simplex. By leveraging the VAE's ability to learn a compact representation of the belief state, we focus computational power on the relevant compressed belief states during each Bellman update.","category":"page"},{"location":"","page":"CompressedBeliefMDPs.jl","title":"CompressedBeliefMDPs.jl","text":"using POMDPs, POMDPModels, POMDPTools\nusing CompressedBeliefMDPs\n\npomdp = TMaze(60, 0.9)\nsolver = CompressedBeliefSolver(\n    pomdp;\n    compressor=VAECompressor(123, 6; hidden_dim=10, verbose=true, epochs=2),\n    sampler=PolicySampler(pomdp, n=500),\n    verbose=true, \n    max_iterations=1000, \n    n_generative_samples=30,\n    k=2\n)\npolicy = solve(solver, pomdp)\nrs = RolloutSimulator(max_steps=50)\nr = simulate(rs, pomdp, policy)","category":"page"},{"location":"#Concepts-and-Architecture","page":"CompressedBeliefMDPs.jl","title":"Concepts and Architecture","text":"","category":"section"},{"location":"","page":"CompressedBeliefMDPs.jl","title":"CompressedBeliefMDPs.jl","text":"CompressedBeliefMDPs.jl aims to implement a generalization of the belief compression algorithm for solving large POMDPs. The algorithm has four steps:","category":"page"},{"location":"","page":"CompressedBeliefMDPs.jl","title":"CompressedBeliefMDPs.jl","text":"collect belief samples,\ncompress the samples,\ncreate the compressed belief-state MDP,\nsolve the MDP.","category":"page"},{"location":"","page":"CompressedBeliefMDPs.jl","title":"CompressedBeliefMDPs.jl","text":"Each step is handled by Sampler, Compressor, CompressedBeliefMDP, and CompressedBeliefSolver respectively.","category":"page"},{"location":"","page":"CompressedBeliefMDPs.jl","title":"CompressedBeliefMDPs.jl","text":"For more details, please see the rest of the documentation or the associated paper.","category":"page"},{"location":"samplers/#Samplers","page":"Samplers","title":"Samplers","text":"","category":"section"},{"location":"samplers/#Defining-a-Sampler","page":"Samplers","title":"Defining a Sampler","text":"","category":"section"},{"location":"samplers/","page":"Samplers","title":"Samplers","text":"In this section, we outline the requirements and guidelines for defining a belief Sampler.","category":"page"},{"location":"samplers/#Interface","page":"Samplers","title":"Interface","text":"","category":"section"},{"location":"samplers/","page":"Samplers","title":"Samplers","text":"The Sampler interface only has one method: the functor. For example, if you wanted to implement your own Sampler, you could write something like this","category":"page"},{"location":"samplers/","page":"Samplers","title":"Samplers","text":"struct MySampler <: Compressor\n    foo\n    bar\nend\n\n# functor definition\nfunction (c::MySampler)(pomdp::POMDP)\n    # YOUR CODE HERE\n    return sampled_beliefs\nend","category":"page"},{"location":"samplers/#Implemented-Sampler","page":"Samplers","title":"Implemented Sampler","text":"","category":"section"},{"location":"samplers/","page":"Samplers","title":"Samplers","text":"CompressedBeliefMDPs provides the following generic belief samplers:","category":"page"},{"location":"samplers/","page":"Samplers","title":"Samplers","text":"an exploratory belief expansion sampler\na Policy rollout sampler\nan ExplorationPolicy rollout sampler","category":"page"},{"location":"samplers/#Exploratory-Belief-Expansion","page":"Samplers","title":"Exploratory Belief Expansion","text":"","category":"section"},{"location":"samplers/","page":"Samplers","title":"Samplers","text":"BeliefExpansionSampler","category":"page"},{"location":"samplers/#CompressedBeliefMDPs.BeliefExpansionSampler","page":"Samplers","title":"CompressedBeliefMDPs.BeliefExpansionSampler","text":"BeliefExpansionSampler\n\nFast extension of exploratory belief expansion (Algorithm 21.13 in Algorithms for Decision Making) that uses k-d trees.\n\nFields\n\nupdater::Updater: The updater used to update beliefs.\nmetric::NearestNeighbors.MinkowskiMetric: The metric used to measure distances between beliefs.\n\nIt must be a Minkowski metric.\n\nn::Integer: The number of belief expansions to perform.\n\nConstructors\n\nBeliefExpansionSampler(pomdp::POMDP; updater::Updater=DiscreteUpdater(pomdp),\nmetric::NearestNeighbors.MinkowskiMetric=Euclidean(), n::Integer=3)\n\nMethods\n\n(s::BeliefExpansionSampler)(pomdp::POMDP)\n\nCreates an initial belief and performs exploratory belief expansion. Returns the unique belief states.  Only works for POMDPs with discrete state, action, and observation spaces.\n\nExample Usage\n\njulia> pomdp = TigerPOMDP();\njulia> sampler = BeliefExpansionSampler(pomdp; n=2);\njulia> beliefs = sampler(pomdp)\nSet{DiscreteBelief{TigerPOMDP, Bool}} with 4 elements:\n  DiscreteBelief{TigerPOMDP, Bool}(TigerPOMDP(-1.0, -100.0, 10.0, 0.85, 0.95), Bool[0, 1], [0.15000000000000002, 0.85])\n  DiscreteBelief{TigerPOMDP, Bool}(TigerPOMDP(-1.0, -100.0, 10.0, 0.85, 0.95), Bool[0, 1], [0.5, 0.5])\n  DiscreteBelief{TigerPOMDP, Bool}(TigerPOMDP(-1.0, -100.0, 10.0, 0.85, 0.95), Bool[0, 1], [0.85, 0.15000000000000002])\n  DiscreteBelief{TigerPOMDP, Bool}(TigerPOMDP(-1.0, -100.0, 10.0, 0.85, 0.95), Bool[0, 1], [0.9697986577181208, 0.030201342281879207])\n\n\n\n\n\n","category":"type"},{"location":"samplers/#Policy-Sampler","page":"Samplers","title":"Policy Sampler","text":"","category":"section"},{"location":"samplers/","page":"Samplers","title":"Samplers","text":"PolicySampler","category":"page"},{"location":"samplers/#CompressedBeliefMDPs.PolicySampler","page":"Samplers","title":"CompressedBeliefMDPs.PolicySampler","text":"PolicySampler\n\nSamples belief states by rolling out a Policy.\n\nFields\n\npolicy::Policy: The policy used for decision making.\nupdater::Updater: The updater used for updating beliefs.\nn::Integer: The maximum number of simulated steps.\nrng::AbstractRNG: The random number generator used for sampling.\nverbose::Bool: Whether to use a progress bar while sampling.\n\nConstructors\n\nPolicySampler(pomdp::POMDP; policy::Policy=RandomPolicy(pomdp), \nupdater::Updater=DiscreteUpdater(pomdp), n::Integer=10, \nrng::AbstractRNG=Random.GLOBAL_RNG)\n\nMethods\n\n(s::PolicySampler)(pomdp::POMDP)\n\nReturns a vector of unique belief states.\n\nExample\n\njulia> pomdp = TigerPOMDP();\njulia> sampler = PolicySampler(pomdp; n=3); \njulia> 2-element Vector{Any}:\nDiscreteBelief{TigerPOMDP, Bool}(TigerPOMDP(-1.0, -100.0, 10.0, 0.85, 0.95), Bool[0, 1], [0.5, 0.5])\nDiscreteBelief{TigerPOMDP, Bool}(TigerPOMDP(-1.0, -100.0, 10.0, 0.85, 0.95), Bool[0, 1], [0.15000000000000002, 0.85])\n\n\n\n\n\n","category":"type"},{"location":"samplers/#ExplorationPolicy-Sampler","page":"Samplers","title":"ExplorationPolicy Sampler","text":"","category":"section"},{"location":"samplers/","page":"Samplers","title":"Samplers","text":"ExplorationPolicySampler","category":"page"},{"location":"samplers/#CompressedBeliefMDPs.ExplorationPolicySampler","page":"Samplers","title":"CompressedBeliefMDPs.ExplorationPolicySampler","text":"ExplorationPolicySampler\n\nSamples belief states by rolling out an ExplorationPolicy. Essentially identical to PolicySampler.\n\nFields\n\nexplorer::ExplorationPolicy: The ExplorationPolicy used for decision making.\non_policy::Policy: The fallback Policy used for decision making when not exploring.\nupdater::Updater: The updater used for updating beliefs.\nn::Integer: The maximum number of simulated steps.\nrng::AbstractRNG: The random number generator used for sampling.\nverbose::Bool: Whether to use a progress bar while sampling.\n\nConstructors\n\nExplorationPolicySampler(pomdp::POMDP; rng::AbstractRNG=Random.GLOBAL_RNG,\nexplorer::ExplorationPolicy=EpsGreedyPolicy(pomdp, 0.1; rng=rng), on_policy=RandomPolicy(pomdp),\nupdater::Updater=DiscreteUpdater(pomdp), n::Integer=10)\n\nMethods\n\n(s::ExplorationPolicySampler)(pomdp::POMDP)\n\nReturns a vector of unique belief states.\n\nExample Usage\n\njulia> pomdp = TigerPOMDP()\njulia> sampler = ExplorationPolicySampler(pomdp; n=30)\njulia> sampler(pomdp)\n3-element Vector{Any}:\n DiscreteBelief{TigerPOMDP, Bool}(TigerPOMDP(-1.0, -100.0, 10.0, 0.85, 0.95), Bool[0, 1], [0.5, 0.5])\n DiscreteBelief{TigerPOMDP, Bool}(TigerPOMDP(-1.0, -100.0, 10.0, 0.85, 0.95), Bool[0, 1], [0.85, 0.15000000000000002])\n DiscreteBelief{TigerPOMDP, Bool}(TigerPOMDP(-1.0, -100.0, 10.0, 0.85, 0.95), Bool[0, 1], [0.9697986577181208, 0.030201342281879207])\n\n\n\n\n\n","category":"type"}]
 }

Num	Action	Description
1	`CMAZE_LEFT`	Move left with von Mises noise.
2	`CMAZE_RIGHT`	Move right with von Mises noise.
3	`CMAZE_SENSE_CORRIDOR`	Observe the current corridor.
4	`CMAZE_DECLARE_GOAL`	Ends the episode. Receive `r_findgoal` if at the goal.