raises error when dataset is an empty list in NanoBEIREvaluator #3122

JINO-ROHIT · 2024-12-08T14:13:54Z

Raises an error when dataset names are a empty list in NanoBEIREvaluator.

Who can review?
@tomaarsen

Copilot reviewed 1 out of 1 changed files in this pull request and generated no suggestions.

tomaarsen · 2024-12-09T10:10:16Z

sentence_transformers/evaluation/NanoBEIREvaluator.py

@@ -420,6 +420,8 @@ def _load_dataset(self, dataset_name: DatasetNameType, **ir_evaluator_kwargs) ->
        )

    def _validate_dataset_names(self):
+        if self.dataset_names == []:


Suggested change

if self.dataset_names == []:

if len(self.dataset_names) == 0:

This is a personal preference, but I prefer checking lengths like this :)

on second thought, theres also this case when datasets = [''], an empty string,

currently it recognizes as an invalid dataset

ValueError: Dataset(s) [''] not found in the NanoBEIR collection.Valid dataset names are: ['climatefever', 'dbpedia', 'fever', 'fiqa2018', 'hotpotqa', 'msmarco', 'nfcorpus', 'nq', 'quoraretrieval', 'scidocs', 'arguana', 'scifact', 'touche2020']

would it be nicer to consider it as an empty dataset and use this check instead?

I don't think we have to worry about this case, I don't think many people would use [""] and if they do, the error is already pretty descriptive.

tomaarsen · 2024-12-10T09:39:23Z

Thanks again!

Tom Aarsen

raises error when dataset is an empty list in NanoBEIREvaluator

406531d

tomaarsen requested a review from Copilot December 9, 2024 09:31

Copilot AI reviewed Dec 9, 2024

View reviewed changes

tomaarsen reviewed Dec 9, 2024

View reviewed changes

fix len

71ea920

tomaarsen merged commit 58d68ac into UKPLab:master Dec 10, 2024
9 checks passed

tomaarsen mentioned this pull request Dec 10, 2024

[typo] Add missing space between sentences in error message #3125

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

raises error when dataset is an empty list in NanoBEIREvaluator #3122

raises error when dataset is an empty list in NanoBEIREvaluator #3122

JINO-ROHIT commented Dec 8, 2024

tomaarsen Dec 9, 2024

JINO-ROHIT Dec 9, 2024

JINO-ROHIT Dec 9, 2024

tomaarsen Dec 10, 2024

JINO-ROHIT Dec 10, 2024

tomaarsen commented Dec 10, 2024

	if self.dataset_names == []:
	if len(self.dataset_names) == 0:

raises error when dataset is an empty list in NanoBEIREvaluator #3122

raises error when dataset is an empty list in NanoBEIREvaluator #3122

Conversation

JINO-ROHIT commented Dec 8, 2024

Choose a reason for hiding this comment

tomaarsen Dec 9, 2024

Choose a reason for hiding this comment

JINO-ROHIT Dec 9, 2024

Choose a reason for hiding this comment

JINO-ROHIT Dec 9, 2024

Choose a reason for hiding this comment

tomaarsen Dec 10, 2024

Choose a reason for hiding this comment

JINO-ROHIT Dec 10, 2024

Choose a reason for hiding this comment

tomaarsen commented Dec 10, 2024