Ground Truth
نویسنده
چکیده
The concept of ground truth is a well-established principle in cartography, where data collected at a distance are confirmed by measurements made on location. Those local measurements are used to calibrate remote sensing devices, verify or correct experimental inferences, and update geographic databases. Ground truth observations also provide a means of training and supervising image classification software and resolving errors of omission or commission. Cartographic methods have improved significantly because of the development of precise positioning methods (GPS), the development of interoperable data standards for rapid exchange of precise and highly interlinked information , and the development of various devices and visualizations that serve-up information on-demand to different classes of end-users. There are strong parallels between mapping geographical and biological space. Much of what is underway in genomics, evolutionary biology and systems biology is analogous to the development of a coordinate system onto which living systems can be mapped, natural boundaries and interrelationships uncovered, and predictions of properties and behaviors based. However, it is likely that the dimensionality of any biological coordinate system will exceed the four dimensions of the geographical system. This will confound visualization and complicate " navigation " through biological space, whether it is for purely exploratory purposes or to get from one point to another. It is a given that the volume of biological data will continue to grow super-linearly for the foreseeable future, as new computational methods are applied to answer the " big questions " in biology. In the absence of major innovation, it is likely that the gap between the cost of data analysis and the cost of data generation will continue to widen. The outcome of such analyses are highly dependent on the quality of the input data, including the underlying information and knowledge used to inform the creation of datasets, the algorithms used in analyses, and the interpretation. Errors of commission and omission appear to be more common in biological data sets than physical data set and the former are more likely to be affected by semantic ambiguity and hidden biases. What is not yet established is which of the labor-intensive cu-ratorial and interpretive tasks can be automated and what metadata that is absent from the public databases may be located and recovered from other sources in a usable form. The impact of semantic ambiguity in biological data has been noted previously as it pertains to iden-tifiers [1] or biological names …
منابع مشابه
University of Dublin TRINITY COLLEGE Ground truth specification for video
Ground truth is used to establish a base level of information about what is in an image so we can evaluate image detection and classification algorithms. However currently in the field of computer vision there is very little ground truth available for video and no consensus on what form that ground truth should take. Often ground truth is created solely to test a particular technique which rest...
متن کاملApplying a climatologically oriented GIS in comparison of TRMM estimated severe thunderstorm rainfalls with ground truth in Sydney metropolitan area
The main objective of the current research was comparison of severe thunderstorm rainfalls with TRMM data when flash flooding events observed in the Sydney Metropolitan Area (SMA) located in NSW, Australia. Severe Thunderstorm Rainfall Events have been first extracted from the severe storm archive of the Australian BOM, by induction of specific criteria. The corresponded derived dataset includ...
متن کاملEffect of Errors in Ground Truth on Classification Accuracy
The effect of errors in ground truth on the estimated thematic accuracy of a classifier is considered. A relationship is derived between the true accuracy of a classifier relative to ground truth without errors, the actual accuracy of the ground truth used, and the measured accuracy of the classifier as a function of the number of classes. We show that if the accuracy of the ground truth is kno...
متن کاملSemi-automatic Ground Truth Generation for Chart Image Recognition
While research on scientific chart recognition is being carried out, there is no suitable standard that can be used to evaluate the overall performance of the chart recognition results. In this paper, a system for semi-automatic chart ground truth generation is introduced. Using the system, the user is able to extract multiple levels of ground truth data. The role of the user is to perform veri...
متن کاملVehicle Ground-Truth Database for the Vertical-View Ft. Hood Imagery
This paper reports the work on building the ground-truth databases for the vehicles in the vertical-view Ft. Hood (VVFH) image data set. We briefly describe the protocols followed in manual annotation of the images, how the ground-truth information is inferred from the annotated image data, and the major entities in the ground-truth database. The vehicle detection performance of a few algorithm...
متن کاملTraining Deep Learning based Denoisers without Ground Truth Data
Recent deep learning based denoisers are trained to minimize the mean squared error (MSE) between the output of a network and the ground truth noiseless image in the training data. Thus, it is crucial to have high quality noiseless training data for high performance denoisers. Unfortunately, in some application areas such as medical imaging, it is expensive or even infeasible to acquire such a ...
متن کامل