Redirecting to original paper in 30 seconds...
Click below to go immediately or wait for automatic redirect
Investigates the effectiveness of using uncertainty scores derived from softmax distributions as signals for unsupervised ranking of Large Multimodal Models (LMMs). Demonstrates that these uncertainty scores provide a robust and consistent basis for ranking models across various Visual Question Answering (VQA) tasks, enabling model selection without ground-truth labels.
Significantly reduces the cost and effort required to select the best performing LMM for a given task, accelerating development cycles and improving the efficiency of AI deployments. Enables users to choose models without needing extensive labeled evaluation datasets.