version française rss feed
HAL : in2p3-00580588, version 1

Fiche détaillée  Récupérer au format
OPT 2009: 2nd NIPS Workshop on Optimization for Machine Learning, Whistler : Canada (2009)
Bandit-Aided Boosting
R. Busa-Fekete1, 2, B. Kégl1, 2, 3

In this paper we apply multi-armed bandits (MABs) to accelerate ADABOOST. ADABOOST constructs a strong classifier in a stepwise fashion by selecting simple base classifiers and using their weighted "vote" to determine the final classification. We model this stepwise base classifier selection as a sequential decision problem, and optimize it with MABs. Each arm represent a subset of the base classifier set. The MAB gradually learns the "utility" of the subsets, and selects one of the subsets in each iteration. ADABOOST then searches only this subset instead of optimizing the base classifier over the whole space. The reward is defined as a function of the accuracy of the base classifier. We investigate how the MAB algorithms (UCB, UCT) can be applied in the case of boosted stumps, trees, and products of base classifiers. On benchmark datasets, our bandit-based approach achieves only slightly worse test errors than the standard boosted learners for a computational cost that is an order of magnitude smaller than with standard ADABOOST.
1 :  LAL - Laboratoire de l'Accélérateur Linéaire
2 :  LRI - Laboratoire de Recherche en Informatique
3 :  INRIA Saclay - Ile de France - TAO
Informatique/Performance et fiabilité

Informatique/Algorithme et structure de données
Liste des fichiers attachés à ce document : 
OPT2009-BusaFekete.pdf(454.5 KB)