First Person Vision (Egocentric) video analysis stands nowa-days as one of the emerging fields in computer vision. The availability of wearable devices recording exactly what the user is looking at is ineluctable and the opportunities and chal-lenges carried by this kind of devices are broad. Particularly, for the first time a device is so intimate with the user to be able to record the movements of his hands, making hand-based ap-plications for First Person Vision one the most explored area in the field. This paper explores the more popular processing steps to develop hand-based applications, and proposes a hi-erarchical structure that optimally switches between each of the levels to reduce the computational cost of the system and improve ...