Skip to content

Navigation Menu

Explore
By company size
By use case
By industry
View all solutions
Topics
- AI
- DevOps
- Security
- Software Development
- View all
Explore
- GitHub Sponsors
  Fund open source developers
- The ReadME Project
  GitHub community articles
Repositories
- Enterprise platform
  AI-powered developer platform
Available add-ons
Pricing

Search code, repositories, users, issues, pull requests...

Search

Clear

Search syntax tips

Provide feedback

We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Saved searches

Use saved searches to filter your results more quickly

Name

Query

To see all available qualifiers, see our documentation.

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

embeddings-benchmark / mteb Public

Notifications You must be signed in to change notification settings
Fork 271
Star 1.9k

Code
Issues 103
Pull requests 12
Discussions
Actions
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Security
Insights

Releases: embeddings-benchmark/mteb

Releases · embeddings-benchmark/mteb

1.19.4

11 Nov 09:18

KennethEnevoldsen

Compare

Choose a tag to compare

Loading

1.19.4 Latest

Latest

1.19.4 (2024-11-11)

Fix

fix: Add missing benchmarks in benchmarks.py (#1431)

Fixes #1423 (a240ea0)

fix: Add Korean AutoRAGRetrieval (#1388)
feat: add AutoRAG Korean embedding retrieval benchmark
fix: run --- 🧹 Running linters ---
ruff format . # running ruff formatting
716 files left unchanged
ruff check . --fix # running ruff linting
All checks passed!
fix: add metadata for AutoRAGRetrieval
change link for markers_bm
add AutoRAGRetrieval to init.py and update metadata
add precise metadata
update metadata: description and license
delete descriptive_stats in AutoRAGRetrieval.py and run calculate_matadata_metrics.py (f79d9ba)
fix: make samples_per_label a task attribute (#1419)

make samples_per_label a task attr (7f1a1d3)

Unknown

Update tasks table (d069aba)

Assets 6

Loading

All reactions

1.19.3

11 Nov 08:33

KennethEnevoldsen

Compare

Choose a tag to compare

Loading

1.19.3

1.19.3 (2024-11-11)

Documentation

docs: Fix a typo in README (#1430)

Fix typo in readme (9681eb3)

docs: Update recommendation for pushing results (#1401)

fix: Update recommendation for pushing results (fccf034)

Fix

fix: add logging for RetrievalEvaluator NaN values for similarity scores (#1398)

Fixes #1389 (cc7a106)

Assets 6

Loading

All reactions

1.19.2

07 Nov 21:30

KennethEnevoldsen

Compare

Choose a tag to compare

Loading

1.19.2

1.19.2 (2024-11-07)

Fix

fix: Added the necessary trust_remote_code (#1406) (fd8b283)

Assets 6

Loading

All reactions

1.19.1

07 Nov 13:53

KennethEnevoldsen

Compare

Choose a tag to compare

Loading

1.19.1

1.19.1 (2024-11-07)

Fix

fix: Add the_ugly_duckling.txt for speedtask to Python wheel (#1402)

Add the_ugly_duckling.txt for speedtask to Python wheel (b1a0ec6)

Assets 6

Loading

All reactions

1.19.0

06 Nov 13:23

KennethEnevoldsen

Compare

Choose a tag to compare

Loading

1.19.0

1.19.0 (2024-11-06)

Feature

feat: Standardize descriptive stats (#1375)
init
mock tests
remove debug prints
remove descriptive stats and move to n_samples
fix typo
fix create task table
fix citations
remove n_samples
metadata per task
add test
reformat task table
add n_samples property
fix tests
rename total_symbols to number_of_characters
Update mteb/abstasks/AbsTaskRetrieval.py

Co-authored-by: Kenneth Enevoldsen <[email protected]>

rename in tests
lint

Co-authored-by: Kenneth Enevoldsen <[email protected]> (2854fa2)

Unknown

Update tasks table (537b974)

Assets 6

Loading

All reactions

1.18.9

06 Nov 10:41

KennethEnevoldsen

Compare

Choose a tag to compare

Loading

1.18.9

1.18.9 (2024-11-06)

Fix

fix: Disable rich output with verbosity=0 on evaluation.run (#1395)
feat: verbose=0 now supress rich console output
fix: removed unnecesary list comprenhension (1bb1ca3)

Assets 6

Loading

All reactions

1.18.8

04 Nov 23:37

KennethEnevoldsen

Compare

Choose a tag to compare

Loading

1.18.8

1.18.8 (2024-11-04)

Fix

fix: Update logging verbosity levels in MTEB (#1384)
Fix verbosity handling in MTEB.py for consistent logging
updates
update docstrings
linting code (35daf58)

Assets 6

Loading

All reactions

1.18.7

04 Nov 14:14

KennethEnevoldsen

Compare

Choose a tag to compare

Loading

1.18.7

1.18.7 (2024-11-04)

Fix

fix: Leaderboard UI improvements (#1370)
Added elementary search bar with RegEx functionality
Improved layout
Table improvements, adjusted column width and added links
Update benchmark filter description

Co-authored-by: Isaac Chung <[email protected]>

Added instructions on multiple search queries

Co-authored-by: Isaac Chung <[email protected]>

Co-authored-by: Isaac Chung <[email protected]> (92fe9cb)

Assets 6

Loading

All reactions

1.18.6

31 Oct 08:33

KennethEnevoldsen

Compare

Choose a tag to compare

Loading

1.18.6

1.18.6 (2024-10-31)

Fix

fix: Integrate prompts to task metadata (#1300)
init
add DatasetDict
add classification
add clustering
add pair classification
add retrieval
add all prompts
start integrating prompts
refactor instruct models
lint
fix test
fix
fix no prompt in prompt dict
add more logging
add more logging
Apply suggestions from code review

Co-authored-by: Isaac Chung <[email protected]>

fix code review
fix use_instructions
add log if instruction template not set
fix metadata
lint
fix brazilian
remove MetadataDatasetDict
rollback test metadata

Co-authored-by: Isaac Chung <[email protected]> (029d378)

Assets 6

Loading

All reactions

1.18.5

31 Oct 07:35

KennethEnevoldsen

Compare

Choose a tag to compare

Loading

1.18.5

1.18.5 (2024-10-31)

Fix

fix: Speed up leaderboard by caching and skipping validation (#1365)
Made loading and filtering faster by removing unnecessary validation
Made select_tasks faster by removing validation
Added caching to leaderboard
Ran linting
Added missing future import (f1bc375)

Assets 6

Loading

All reactions

Previous 1 2 3 4 5 … 33 34 Next

Previous Next

Footer

© 2024 GitHub, Inc.

Footer navigation

Terms
Privacy
Security
Status
Docs
Contact

You can’t perform that action at this time.