Look at OFVChallenger #457

kongzii · 2024-09-09T12:02:30Z

After a few days:

resolve some markets manually to check his accuracy
verify it's claiming bonds back
check his losses
???

kongzii · 2024-09-12T10:19:52Z

resolve some markets manually to check his accuracy

I manually annotated 40 questions in Langfuse:

2 are resolved incorrectly, so we have 95% accuracy
7 are resolved differently from Olas' resolver, from these, 5 are correct
- so Olas' resolver has 87.5% accuracy

Unfortunately, no human challenged these 2 wrong answers. I caught the second one just in time and corrected it by myself, but the first one is now finalized wrongly.

I will check another batch of questions again next week.

kongzii · 2024-09-12T10:22:41Z

verify it's claiming bonds back
check his losses

The agent is getting its xDai back, for example, this transaction https://gnosisscan.io/tx/0x0a8f6a00388be5479600cb2503fb6dabef77c78eb1483c9c3bdda9855a2cb67a.

Just that it loses 0.001 xDai every time it posts the same answer as already posted.

kongzii added the high priority label Sep 9, 2024

kongzii added this to the Reality Resolutions to >90% Accuracy milestone Sep 9, 2024

kongzii self-assigned this Sep 12, 2024

This was referenced Sep 19, 2024

Improve resolution accuracy on Omen gnosis/prediction-market-agent-tooling#319

Closed

Create script that will automatically measure accuracy of OFVChallenger based on Reality challenges #483

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Look at OFVChallenger #457

Look at OFVChallenger #457

kongzii commented Sep 9, 2024

kongzii commented Sep 12, 2024 •

edited

Loading

kongzii commented Sep 12, 2024 •

edited

Loading

Look at OFVChallenger #457

Look at OFVChallenger #457

Comments

kongzii commented Sep 9, 2024

kongzii commented Sep 12, 2024 • edited Loading

kongzii commented Sep 12, 2024 • edited Loading

kongzii commented Sep 12, 2024 •

edited

Loading

kongzii commented Sep 12, 2024 •

edited

Loading