Skip to content

Commit

Permalink
added videos
Browse files Browse the repository at this point in the history
  • Loading branch information
Jason Ma committed Oct 20, 2023
1 parent e0c1c13 commit 1876a21
Show file tree
Hide file tree
Showing 3 changed files with 23 additions and 3 deletions.
26 changes: 23 additions & 3 deletions index.html
Original file line number Diff line number Diff line change
Expand Up @@ -451,6 +451,20 @@ <h2 class="title is-3"><span class="dvima">Eureka Components</span></h2>
<div class="rows">
<div class="rows is-centered ">
<div class="row is-full-width">

<div class="columns is-centered has-text-centered">
<video poster="" id="" autoplay controls muted loop width="70%" height="100%" playbackRate=2.0 style="border-radius: 5px;">
<source src="videos/eureka_env_context.mp4" type="video/mp4">
</video>
</div>
<span style="font-size: 125%">
<span style="font-weight: bold">Environment as Context.</span>
By using the raw environment code as context, Eureka can zero-shot generate plausible reward programs, without any task-specific prompt engineering. This allows Eureka to be a <b>generalist</b> reward designer,
readily producing reward functions on first try for all our environments. </span>
<br>
<br>
<br>

<div class="columns is-centered has-text-centered">
<video poster="" id="" autoplay controls muted loop width="70%" height="100%" playbackRate=2.0 style="border-radius: 5px;">
<source src="videos/shadow_hand_multi.mp4" type="video/mp4">
Expand All @@ -464,8 +478,14 @@ <h2 class="title is-3"><span class="dvima">Eureka Components</span></h2>
<br>
<br>
<br>
<img src="assets/images/reward_diff.png" class="interpolation-image" alt=""
style="display: block; margin-left: auto; margin-right: auto; max-width: 80%;" />

<div class="columns is-centered has-text-centered">
<video poster="" id="" autoplay controls muted loop width="70%" height="100%" playbackRate=2.0 style="border-radius: 5px;">
<source src="videos/eureka_reward_reflection.mp4" type="video/mp4">
</video>
</div>
<!-- <img src="assets/images/reward_diff.png" class="interpolation-image" alt="" -->
<!-- style="display: block; margin-left: auto; margin-right: auto; max-width: 80%;" /> -->
<span style="font-size: 125%">
<span style="font-weight: bold">Eureka Reward Reflection.</span> After reward evaluation,
Eureka constructs reward reflection that summarizes the key statistics of the RL training. Then, Eureka uses this reward reflection
Expand Down Expand Up @@ -598,7 +618,7 @@ <h3 class="title is-3"><span class="dvima">Eureka from Human Feedback</span></h3
<br>

<img src="assets/images/bidex_reward_assistant.png" class="interpolation-image" alt=""
style="display: block; width: 80%; margin-left: auto; margin-right: auto" />
style="display: block; width: 70%; margin-left: auto; margin-right: auto" />
<br>
<span style="font-size: 125%">
<span style="font-weight: bold">Eureka effectively improves and benefits from human reward initialization.</span>
Expand Down
Binary file added videos/eureka_env_context.mp4
Binary file not shown.
Binary file added videos/eureka_reward_reflection.mp4
Binary file not shown.

0 comments on commit 1876a21

Please sign in to comment.