Abstract 问题: 1)evaluation is often not suffcient 2)biased for certain types of algorthms 3)datasets do not have common ground-truth object positions or extents 4)the initial conditions or parameters of the evaluated tracking algorithms are not the sa