MOL-2607/ feat: implement random-k selector #10

emalgorithm · 2025-04-08T15:18:30Z

Summary

Implements a new RandomSelector class that selects the best value from k randomly chosen values in the input array. This allows for easier benchmarking of confidence model, since the perfomance obtained with a RandomSelector is the same that would be obtained with a TopSelector by a random confidence model.

Changes

Added RandomSelector class to selector.py
Added unit test for the new selector in test_selectors.py

Implementation Details

RandomSelector randomly selects k indices from the input array
Returns either the minimum or maximum value from the selected subset depending on the smaller_is_better parameter
Test verifies that the selector, when run multiple times, produces results with an average close to the expected value

linear · 2025-04-08T15:18:33Z

MOL-2607 feat: implement random-k selector in Peppr

padix-key

Hi, thanks for adding the selector. I added some comments below.

padix-key · 2025-04-09T07:14:32Z

src/peppr/selector.py

+        The best value is chosen from *k* randomly chosen predictions.
+    """
+
+    def __init__(self, k: int) -> None:


I think it would be good to be able to provide an optional seed to the constructor that would be used to initialize a NumPy Generator

Good point, addressed in ce196e1!

padix-key · 2025-04-09T07:16:01Z

tests/test_selectors.py

+    Check that the :class:`RandomSelector`, when ran multiple times, selects approximately the expected value for known
+    examples.
+    """
+    selector = peppr.RandomSelector(k=5)


The random seed could be used in this test to make it deterministic

Addressed in af4a29a.

padix-key · 2025-04-09T07:18:18Z

tests/test_selectors.py

+        selector.select(values, smaller_is_better=False) for _ in range(20)
+    ]
+
+    assert np.isclose(np.mean(selected_values), 5.0, rtol=0.5)


Currently this test fails, probably because sometimes the mean is randomly not within the tolerance. Using a much larger k (and more values to select from) would solve this, right?

Addressed in af4a29a. I was actually computing the wrong expected value.

padix-key · 2025-04-09T10:59:04Z

Looks good to me now, thanks!

emalgorithm added 2 commits April 8, 2025 17:12

feat: implement selector

21e0eb8

test: random selector

da115d9

fix: import bug

7bd63bd

emalgorithm marked this pull request as ready for review April 8, 2025 15:22

emalgorithm requested a review from padix-key April 8, 2025 15:22

fix: more fixes

76625c9

padix-key reviewed Apr 9, 2025

View reviewed changes

padix-key changed the title ~~MOL-2607/ feat: implement random-k selector in Peppr~~ MOL-2607/ feat: implement random-k selector Apr 9, 2025

emalgorithm added 3 commits April 9, 2025 10:52

feat: add random seed to selector

ce196e1

test: random selector

af4a29a

chore: add seed to docstring

d7187c2

emalgorithm requested a review from padix-key April 9, 2025 08:55

padix-key approved these changes Apr 9, 2025

View reviewed changes

emalgorithm merged commit c7f0a7a into main Apr 9, 2025
6 checks passed

emalgorithm deleted the mol-2607-feat-implement-random-k-selector-in-peppr branch April 9, 2025 10:59

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

MOL-2607/ feat: implement random-k selector #10

MOL-2607/ feat: implement random-k selector #10

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

MOL-2607/ feat: implement random-k selector #10

MOL-2607/ feat: implement random-k selector #10

Conversation

Uh oh!

Summary

Changes

Implementation Details

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!