You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: dev/benchmark_interfaces/index.html
+1-1Lines changed: 1 addition & 1 deletion
Original file line number
Diff line number
Diff line change
@@ -39,4 +39,4 @@
39
39
reset!(env::AbstractEnvironment; reset_rng::Bool, seed=get_seed(env)) # Reset to initial state
40
40
observe(env::AbstractEnvironment) -> (obs, info) # Get current observation
41
41
step!(env::AbstractEnvironment, action) -> reward # Take action, get reward
42
-
is_terminated(env::AbstractEnvironment) -> Bool # Check if episode ended</code></pre></article><navclass="docs-footer"><aclass="docs-footer-prevpage" href="../">« Getting started</a><aclass="docs-footer-nextpage" href="../warcraft_tutorial/">Path-finding on image maps »</a><divclass="flexbox-break"></div><pclass="footer-message">Powered by <ahref="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <ahref="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><divclass="modal" id="documenter-settings"><divclass="modal-background"></div><divclass="modal-card"><headerclass="modal-card-head"><pclass="modal-card-title">Settings</p><buttonclass="delete"></button></header><sectionclass="modal-card-body"><p><labelclass="label">Theme</label><divclass="select"><selectid="documenter-themepicker"><optionvalue="auto">Automatic (OS)</option><optionvalue="documenter-light">documenter-light</option><optionvalue="documenter-dark">documenter-dark</option><optionvalue="catppuccin-latte">catppuccin-latte</option><optionvalue="catppuccin-frappe">catppuccin-frappe</option><optionvalue="catppuccin-macchiato">catppuccin-macchiato</option><optionvalue="catppuccin-mocha">catppuccin-mocha</option></select></div></p><hr/><p>This document was generated with <ahref="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.16.1 on <spanclass="colophon-date" title="Monday 12 January 2026 08:15">Monday 12 January 2026</span>. Using Julia version 1.12.4.</p></section><footerclass="modal-card-foot"></footer></div></div></div></body></html>
42
+
is_terminated(env::AbstractEnvironment) -> Bool # Check if episode ended</code></pre></article><navclass="docs-footer"><aclass="docs-footer-prevpage" href="../">« Getting started</a><aclass="docs-footer-nextpage" href="../warcraft_tutorial/">Path-finding on image maps »</a><divclass="flexbox-break"></div><pclass="footer-message">Powered by <ahref="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <ahref="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><divclass="modal" id="documenter-settings"><divclass="modal-background"></div><divclass="modal-card"><headerclass="modal-card-head"><pclass="modal-card-title">Settings</p><buttonclass="delete"></button></header><sectionclass="modal-card-body"><p><labelclass="label">Theme</label><divclass="select"><selectid="documenter-themepicker"><optionvalue="auto">Automatic (OS)</option><optionvalue="documenter-light">documenter-light</option><optionvalue="documenter-dark">documenter-dark</option><optionvalue="catppuccin-latte">catppuccin-latte</option><optionvalue="catppuccin-frappe">catppuccin-frappe</option><optionvalue="catppuccin-macchiato">catppuccin-macchiato</option><optionvalue="catppuccin-mocha">catppuccin-mocha</option></select></div></p><hr/><p>This document was generated with <ahref="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.17.0 on <spanclass="colophon-date" title="Tuesday 3 March 2026 14:55">Tuesday 3 March 2026</span>. Using Julia version 1.12.5.</p></section><footerclass="modal-card-foot"></footer></div></div></div></body></html>
Copy file name to clipboardExpand all lines: dev/benchmarks/dvsp/index.html
+1-1Lines changed: 1 addition & 1 deletion
Original file line number
Diff line number
Diff line change
@@ -3,4 +3,4 @@
3
3
\fbox{Neural network $\varphi_w$}
4
4
\xrightarrow[\text{Prizes}]{\theta}
5
5
\fbox{Prize-collecting VSP}
6
-
\xrightarrow[\text{Routes}]{a_t}\]</p><p><strong>Components</strong>:</p><ol><li><strong>Neural Network</strong><span>$\varphi_w$</span>: Takes current state features as input and predicts customer prizes <span>$\theta = (\theta_1, \ldots, \theta_n)$</span>, one value per postponable customer.</li><li><strong>Optimization Layer</strong>: Solves the prize-collecting vehicle scheduling problem to determine optimal routes given the predicted prizes, by maximizing total collected prizes minus travel costs: <code>math \max_{a_t\in \mathcal{A}(s_t)} \sum_{r \in a_t} \left( \sum_{i \in r} \theta_i - \sum_{(i,j) \in r} d_{ij} \right)</code> This can be modeled as a flow linear program on a directed acyclic graph (DAG) and is solved using standard LP solvers.</li></ol><p>The neural network architecture adapts to the feature dimensionality:</p><ul><li><strong>2D features</strong>: <code>Dense(2 => 1)</code>, applied in parallel to each postponable customer</li><li><strong>Full features</strong>: <code>Dense(27 => 1)</code> applied in parallel to each postponable customer</li></ul><p><strong>Note:</strong> one can also use more complex architectures such as a deeper MLP or a graph neural network for better performance.</p></article><navclass="docs-footer"><aclass="docs-footer-prevpage" href="../argmax/">« Argmax</a><aclass="docs-footer-nextpage" href="../dynamic_assortment/">Dynamic Assortment »</a><divclass="flexbox-break"></div><pclass="footer-message">Powered by <ahref="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <ahref="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><divclass="modal" id="documenter-settings"><divclass="modal-background"></div><divclass="modal-card"><headerclass="modal-card-head"><pclass="modal-card-title">Settings</p><buttonclass="delete"></button></header><sectionclass="modal-card-body"><p><labelclass="label">Theme</label><divclass="select"><selectid="documenter-themepicker"><optionvalue="auto">Automatic (OS)</option><optionvalue="documenter-light">documenter-light</option><optionvalue="documenter-dark">documenter-dark</option><optionvalue="catppuccin-latte">catppuccin-latte</option><optionvalue="catppuccin-frappe">catppuccin-frappe</option><optionvalue="catppuccin-macchiato">catppuccin-macchiato</option><optionvalue="catppuccin-mocha">catppuccin-mocha</option></select></div></p><hr/><p>This document was generated with <ahref="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.16.1 on <spanclass="colophon-date" title="Monday 12 January 2026 08:15">Monday 12 January 2026</span>. Using Julia version 1.12.4.</p></section><footerclass="modal-card-foot"></footer></div></div></div></body></html>
6
+
\xrightarrow[\text{Routes}]{a_t}\]</p><p><strong>Components</strong>:</p><ol><li><strong>Neural Network</strong><span>$\varphi_w$</span>: Takes current state features as input and predicts customer prizes <span>$\theta = (\theta_1, \ldots, \theta_n)$</span>, one value per postponable customer.</li><li><strong>Optimization Layer</strong>: Solves the prize-collecting vehicle scheduling problem to determine optimal routes given the predicted prizes, by maximizing total collected prizes minus travel costs: <code>math \max_{a_t\in \mathcal{A}(s_t)} \sum_{r \in a_t} \left( \sum_{i \in r} \theta_i - \sum_{(i,j) \in r} d_{ij} \right)</code> This can be modeled as a flow linear program on a directed acyclic graph (DAG) and is solved using standard LP solvers.</li></ol><p>The neural network architecture adapts to the feature dimensionality:</p><ul><li><strong>2D features</strong>: <code>Dense(2 => 1)</code>, applied in parallel to each postponable customer</li><li><strong>Full features</strong>: <code>Dense(27 => 1)</code> applied in parallel to each postponable customer</li></ul><p><strong>Note:</strong> one can also use more complex architectures such as a deeper MLP or a graph neural network for better performance.</p></article><navclass="docs-footer"><aclass="docs-footer-prevpage" href="../argmax/">« Argmax</a><aclass="docs-footer-nextpage" href="../dynamic_assortment/">Dynamic Assortment »</a><divclass="flexbox-break"></div><pclass="footer-message">Powered by <ahref="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <ahref="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><divclass="modal" id="documenter-settings"><divclass="modal-background"></div><divclass="modal-card"><headerclass="modal-card-head"><pclass="modal-card-title">Settings</p><buttonclass="delete"></button></header><sectionclass="modal-card-body"><p><labelclass="label">Theme</label><divclass="select"><selectid="documenter-themepicker"><optionvalue="auto">Automatic (OS)</option><optionvalue="documenter-light">documenter-light</option><optionvalue="documenter-dark">documenter-dark</option><optionvalue="catppuccin-latte">catppuccin-latte</option><optionvalue="catppuccin-frappe">catppuccin-frappe</option><optionvalue="catppuccin-macchiato">catppuccin-macchiato</option><optionvalue="catppuccin-mocha">catppuccin-mocha</option></select></div></p><hr/><p>This document was generated with <ahref="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.17.0 on <spanclass="colophon-date" title="Tuesday 3 March 2026 14:55">Tuesday 3 March 2026</span>. Using Julia version 1.12.5.</p></section><footerclass="modal-card-foot"></footer></div></div></div></body></html>
Copy file name to clipboardExpand all lines: dev/benchmarks/dynamic_assortment/index.html
+1-1Lines changed: 1 addition & 1 deletion
Original file line number
Diff line number
Diff line change
@@ -9,4 +9,4 @@
9
9
\fbox{Neural network $\varphi_w$}
10
10
\xrightarrow[\text{Cost vector}]{\theta}
11
11
\fbox{Top K}
12
-
\xrightarrow[\text{Assortment}]{a_t}\]</p><p><strong>Components</strong>:</p><ol><li><strong>Neural Network</strong><span>$\varphi_w$</span>: Takes the current state <span>$s_t$</span> as input and predicts item utilities <span>$\theta = (\theta_1, \ldots, \theta_N)$</span></li><li><strong>Optimization Layer</strong>: Selects the top <span>$K$</span> items with highest predicted utilities to form the assortment <span>$a_t$</span></li></ol><h2id="Reference"><aclass="docs-heading-anchor" href="#Reference">Reference</a><aid="Reference-1"></a><aclass="docs-heading-anchor-permalink" href="#Reference" title="Permalink"></a></h2><p>Based on the paper: <ahref="https://arxiv.org/abs/2505.19053">Structured Reinforcement Learning for Combinatorial Decision-Making</a></p></article><navclass="docs-footer"><aclass="docs-footer-prevpage" href="../dvsp/">« Dynamic Vehicle Scheduling</a><aclass="docs-footer-nextpage" href="../fixed_size_shortest_path/">Shortest paths »</a><divclass="flexbox-break"></div><pclass="footer-message">Powered by <ahref="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <ahref="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><divclass="modal" id="documenter-settings"><divclass="modal-background"></div><divclass="modal-card"><headerclass="modal-card-head"><pclass="modal-card-title">Settings</p><buttonclass="delete"></button></header><sectionclass="modal-card-body"><p><labelclass="label">Theme</label><divclass="select"><selectid="documenter-themepicker"><optionvalue="auto">Automatic (OS)</option><optionvalue="documenter-light">documenter-light</option><optionvalue="documenter-dark">documenter-dark</option><optionvalue="catppuccin-latte">catppuccin-latte</option><optionvalue="catppuccin-frappe">catppuccin-frappe</option><optionvalue="catppuccin-macchiato">catppuccin-macchiato</option><optionvalue="catppuccin-mocha">catppuccin-mocha</option></select></div></p><hr/><p>This document was generated with <ahref="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.16.1 on <spanclass="colophon-date" title="Monday 12 January 2026 08:15">Monday 12 January 2026</span>. Using Julia version 1.12.4.</p></section><footerclass="modal-card-foot"></footer></div></div></div></body></html>
12
+
\xrightarrow[\text{Assortment}]{a_t}\]</p><p><strong>Components</strong>:</p><ol><li><strong>Neural Network</strong><span>$\varphi_w$</span>: Takes the current state <span>$s_t$</span> as input and predicts item utilities <span>$\theta = (\theta_1, \ldots, \theta_N)$</span></li><li><strong>Optimization Layer</strong>: Selects the top <span>$K$</span> items with highest predicted utilities to form the assortment <span>$a_t$</span></li></ol><h2id="Reference"><aclass="docs-heading-anchor" href="#Reference">Reference</a><aid="Reference-1"></a><aclass="docs-heading-anchor-permalink" href="#Reference" title="Permalink"></a></h2><p>Based on the paper: <ahref="https://arxiv.org/abs/2505.19053">Structured Reinforcement Learning for Combinatorial Decision-Making</a></p></article><navclass="docs-footer"><aclass="docs-footer-prevpage" href="../dvsp/">« Dynamic Vehicle Scheduling</a><aclass="docs-footer-nextpage" href="../fixed_size_shortest_path/">Shortest paths »</a><divclass="flexbox-break"></div><pclass="footer-message">Powered by <ahref="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> and the <ahref="https://julialang.org/">Julia Programming Language</a>.</p></nav></div><divclass="modal" id="documenter-settings"><divclass="modal-background"></div><divclass="modal-card"><headerclass="modal-card-head"><pclass="modal-card-title">Settings</p><buttonclass="delete"></button></header><sectionclass="modal-card-body"><p><labelclass="label">Theme</label><divclass="select"><selectid="documenter-themepicker"><optionvalue="auto">Automatic (OS)</option><optionvalue="documenter-light">documenter-light</option><optionvalue="documenter-dark">documenter-dark</option><optionvalue="catppuccin-latte">catppuccin-latte</option><optionvalue="catppuccin-frappe">catppuccin-frappe</option><optionvalue="catppuccin-macchiato">catppuccin-macchiato</option><optionvalue="catppuccin-mocha">catppuccin-mocha</option></select></div></p><hr/><p>This document was generated with <ahref="https://github.com/JuliaDocs/Documenter.jl">Documenter.jl</a> version 1.17.0 on <spanclass="colophon-date" title="Tuesday 3 March 2026 14:55">Tuesday 3 March 2026</span>. Using Julia version 1.12.5.</p></section><footerclass="modal-card-foot"></footer></div></div></div></body></html>
0 commit comments