Towards a statistical theory of data selection under weak supervision

Germain Kolossov,Andrea Montanari,Pulkit Tandon

Given a sample of size $N$, it is often useful to select a subsample of smaller size $n