-
Notifications
You must be signed in to change notification settings - Fork 240
Issues: embeddings-benchmark/mteb
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
PreTrainedTokenizerFast._batch_encode_plus() got an unexpected keyword argument 'prompt_name'
#1224
opened Sep 18, 2024 by
violenil
Figure out an approach for adding experiments to the leaderboard
#1211
opened Sep 9, 2024 by
KennethEnevoldsen
Add benchmark overview table
documentation
Improvements or additions to documentation
good first issue
Good for newcomers
#1209
opened Sep 9, 2024 by
KennethEnevoldsen
Evaluating only on English
question
Further information is requested
#1205
opened Sep 8, 2024 by
stephantul
Re-add CQADupstack merge script / find other way to make merging easier
#1171
opened Aug 20, 2024 by
Muennighoff
Add updated Touche-2020
enhancement
New feature or request
good first issue
Good for newcomers
#1170
opened Aug 20, 2024 by
orionw
[New dataset request] Please add MKQA
good first issue
Good for newcomers
new-dataset
#1149
opened Aug 11, 2024 by
PrithivirajDamodaran
Solution for transforming retrieval datasets into parquet
#1090
opened Jul 15, 2024 by
gowitheflow-1998
Implements check on existing and new datasets
enhancement
New feature or request
#1049
opened Jul 5, 2024 by
KennethEnevoldsen
Implements a custom for documents to make data format consistent
enhancement
New feature or request
#1047
opened Jul 5, 2024 by
KennethEnevoldsen
Update prints to use a consistent format:
good first issue
Good for newcomers
help wanted
Extra attention is needed
#1046
opened Jul 5, 2024 by
KennethEnevoldsen
Leaks and duplications in the MTEB leaderboard
bug
Something isn't working
#1036
opened Jul 3, 2024 by
lbourdois
Confusion re: Retrieval w/Instructions
leaderboard
issues related to the leaderboard
#1013
opened Jun 30, 2024 by
Muennighoff
Paper Writing: Speeding up the benchmark [waiting for retrieval]
#1007
opened Jun 28, 2024 by
KennethEnevoldsen
extracting embedding from other than CLS token during eval mteb
#999
opened Jun 28, 2024 by
riyajatar37003
Option to remove cached dataset files on large runs
enhancement
New feature or request
#984
opened Jun 25, 2024 by
isaac-chung
Be able to cache embeddings and load them
enhancement
New feature or request
good first issue
Good for newcomers
#946
opened Jun 17, 2024 by
orionw
Allow for model specific similarity in Bitext Mining and Retrieval
enhancement
New feature or request
#943
opened Jun 17, 2024 by
KennethEnevoldsen
Avoid using global seeds
enhancement
New feature or request
#942
opened Jun 17, 2024 by
KennethEnevoldsen
Previous Next
ProTip!
Exclude everything labeled
bug
with -label:bug.