Evaluating the performance of multi-object tracking (MOT) methods is not straightforward, and existing measures fail to consider all available uncertainty information in MOT context. This can lead practitioners select models which produce estimates lower quality, negatively impacting any downstream systems that rely on them. Additionally, most have hyperparameters, makes comparisons different t...