Action recognition and scene understanding from videos