Researchers at Arizona State University have developed a new technology that incorporates batch mode active learning systems. This method selects a batch of unlabeled data points simultaneously from a given body of unlabeled data as opposed to the pool based method which selects only one at a time. The classifier is retrained once after every batch of data points is selected and labeled. The selection of multiple instances facilitates parallel labeling increasing efficiency and productivity. The proposed technology improves on current models by simultaneously solving for both the batch size as well as the specific data batch to be classified. The batch size and data are determined based on projected improvements in the classifier?s efficiency in classifying unlabeled data.
Potential Applications
Benefits and Advantages