Confidence Measurement Metrics in Multimodal Large Language Models for Ultrasound-Based Radiology Cases: Comparative Evaluation Study of Self-Reported, Consistency-Based, and Hybrid Methods
Background: Large language models (LLMs) require specialized methodologies to quantify model confidence for safe deployment in health care systems; however, the...