Repository logo
 

Towards a Label-Free and Representation-Based Metric for Evaluating Machine Learning Models

Date

2022-08-04

Authors

Xu, Isaac

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

In this work, we explore the viability of proposed label-free metrics to evaluate models. We begin by examining the effect on linear probe accuracy which different viable label schemes on an identical dataset may cause. We show that in a toy setting, a notion of “complexity” for distinguishing classes can have predictive capabilities for anticipating relative “difficulty” a population of models may encounter for a comparison between classification tasks. In establishing these arbitrary relative differences in valid formulations for an evaluation task, we justify the search for a label scheme independent means to evaluate learning. To this end, we examine label-free clustering-based metrics and entropy on representational spaces at progressive milestones during self-supervised learning and on pre-trained representational spaces. While clustering-based metrics show mixed success, entropy may be viable for monitoring learning and cross-architectural comparisons, despite displaying instability in early training and showing differing trends for certain learning methodologies.

Description

The work presented looks to gain insight into notions of complexity and difficulty for a task by conducting an exercise in predicting the degree of difficulty a population of models may have on arbitrary classification tasks for a toy dataset. In establishing the differing model evaluation results from these tasks, an argument is made for a label-free means to evaluate models. Methods for evaluating learning without labels such as a clustering-based metrics and entropy are examined. Entropy was found to be the most effective measure to evaluate learning, but issues pertaining to learning methodology and early learning instability require further study.

Keywords

Machine Learning, Self-Supervised Learning, Clustering, Complexity, Information Theory

Citation