Interpretation of humans’ actions
We will identify the various objects and elements in the human's surroundings and understand how they relate to the human's task, interaction, and overall experience. In order to allow robot learning from visual observation, human actions have to be “detected”. The terminology of detection incorporates estimating temporal boundaries and labels that we will detect from a third-person perspective human videos.