Yifei Huang, PhD

Project assistant professor @ UTokyo

Publications

You may also find my articles on my Google Scholar profile.

International Publications

  • Y. Yin, Y. Huang, R. Furuta and Y. Sato, " Proposal-based Temporal Action Localization with Point-level Supervision," British Machine Vision Conference (BMVC), November 2023.
  • J. Wang, G. Chen, Y. Huang, L. Wang and T. Lu, " Memory-and-Anticipation Transformer for Online Action Understanding," Proceedings of International Conference on Computer Vision (ICCV), October 2023.
  • Y. Huang, L. Yang and Y. Sato, "Weakly Supervised Temporal Sentence Grounding with Uncertainty-Guided Self-training," Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 2023.
  • Q. Zhang, G. Barbareschi, Y. Huang, J. Li, Y.S. Pai, J. Ward, K. Kai, “Seeing our Blind Spots: Smart Glasses-based Simulation to Increase Design Students Awareness of Visual Impairment,” in ACM Symposium on User Interface Software and Technology (UIST) , November 2022.
  • Y. Huang, L. Yang and Y. Sato, “Compound Prototype Matching for Few-shot Action Recognition,” to appear in European Conference on Computer Vision (ECCV), October 2022. [Code]
  • L. Yang, Y. Huang*, Y. Sugano, and Y. Sato, “Interact before Align: Leveraging Cross-Modal Knowledge for Domain Adaptive Action Recognition,” in IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), June 2022.
  • Kristen Grauman, et al., “Ego4D: Around the World in 3,000 Hours of Egocentric Video,” in IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), June 2022.
  • Y. Huang, X. Li, L. Yang, L. Gu, Y. Zhu, H. Seo, Q. Meng, T. Harada, and Y. Sato, “Leveraging Human Selective Attention for Medical Image Analysis with Limited Training Data,” British Machine Vision Conference (BMVC), November 2021. [Paper]
  • L. Yang, Y. Huang*, Y. Sugano, and Y. Sato, “Stacked Temporal Attention: Improving First-person Action Recognition by Emphasizing Discriminative Clips,” in Proc. British Machine Vision Conference (BMVC), November 2021. [Paper]
  • L. Yang, Y. Huang, Y. Sugano, and Y. Sato, “EPIC-KITCHENS-100 Unsupervised Domain Adaptation Challenge for Action Recognition 2021: Team M3EM Technical Report,” The Eighth International Workshop on Egocentric Perception, Interaction and Computing (EPIC), June 2021. (Second Place Winner) [Paper]
  • Z. Li, W. Wang, Z. Li, Y. Huang, and Y. Sato. “Spatio-Temporal Perturbations for Video Attribution,” IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), 2021. [Paper]
  • H. Chen, Y. Huang, H.Takamura, and H. Nakayama. “ Commonsense Knowledge Aware Concept Selection for Diverse and Informative Visual Storytelling.” AAAI Conference on Artificial Intelligence, (AAAI), 2021. [Paper]
  • Y. Liu, L. Zhou, X. Bai, Y. Huang, L. Gu, J. Zhou, T. Harada. “Goal-Oriented Gaze Estimation for Zero-Shot Learning.” Proceedings of IEEE/CVF Conference on Computer Vision and Pattern Recognition, (CVPR), 2021. [Paper]
  • F. Drigalski, K. Hayashi, Y. Huang, R. Yonetani, M. Hamaya, K. Tanaka, Y. Ijiri. “Precise Multi-Modal In-Hand Pose Estimation using Low-Precision Sensors for Robotic Assembly. “ IEEE International Conference on Robotics and Automation (ICRA), 2021. [Paper] [Video]
  • Z. Li, W. Wang, Z. Li, Y. Huang, and Y. Sato. “Towards Visually Explaining Video Understanding Networks with Perturbation.” Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), 2021. [Paper]
  • Y. Huang, M. Cai, and Y. Sato. “An Ego-Vision System for Discovering Human Joint Attention.” IEEE Transactions on Human-Machine Systems (THMS) 2020.
  • Y. Huang, M. Cai, Z. Li, F. Lu, and Y. Sato. “Mutual Context Network for Jointly Estimating Egocentric Gaze and Action.” IEEE Transactions on Image Processing (TIP), 2020.
  • G. Wu, Y. Zheng, Z. Guo, Z. Cai, X. Shi, X. Ding, Y. Huang, Y. Guo, R. Shibasaki. “Learn to Recover Visible Color for Video Surveillance in a Day.” In Proceedings of the European Conference on Computer Vision (ECCV), 2020. (oral, 3%) [Paper]
  • Y. Huang, Y. Sugano and Y. Sato, "Improving Action Segmentation via Graph-based Temporal Reasoning," Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 2020. [Code] [Paper]
  • Z. Li, Y. Huang, M. Cai, and Y. Sato, "Manipulation-skill Assessment from Videos with Spatial Attention Network," Proceedings of IEEE International Conference on Computer Vision Workshop (ICCVW), Nov 2019. [Paper]
  • H. Chen, Y. Huang\*, and H. Nakayama, "Semantic Aware Attention Based Deep Object Co-segmentation," Proceedings of Asian Conference on Computer Vision (ACCV), 2018.
    [Paper]
  • Y. Huang, M. Cai, Z. Li, and Y. Sato, "Predicting gaze in egocentric videos by learning task-dependent attention transition," Proceedings of European Conference on Computer Vision (ECCV), 2018. (oral presentation, acceptance rate: 2%)
    [project] [Paper]
  • Y. Huang, M. Cai, H. Kera, R. Yonetani, K. Higuchi, and Y. Sato, "Temporal localization and spatial segmentation of joint attention in multiple first-person videos," Proceedings of IEEE International Conference on Computer Vision Workshop (ICCVW), pp. 2313-2321, Oct 2017.
    [project] [paper] [poster]

Domestic Publications

  • L. Yang, Y. Huang, Y.Sugano and Y. Sato, "Egocentric Action Recognition from Noisy Videos," Pattern Recognition and Machine Understanding (PRMU), extended abstract, Mar 2020.
  • L. Yang, Y. Huang, and Y. Sato, "Egocentric Action Recognition using Graph Convolutional Network," Meeting on Image Recognition and Understanding (MIRU), extended abstract, Aug 2019.
  • Z. Li, Y. Huang, M. Cai, and Y. Sato, "Manipulation-skill Assessment from Videos with Spatial Attention Network," Meeting on Image Recognition and Understanding (MIRU), extended abstract, Aug 2019.
  • Y. Huang, M. Cai, Z. Li, and Y. Sato, "Mutual Context Network for Jointly Estimating Egocentric Gaze and Actions," Meeting on Image Recognition and Understanding (MIRU), extended abstract, Aug 2019.
  • Z. Li, Y. Huang, M. Cai, and Y. Sato, "Paiwise performance assessment using deep ranking," Meeting on Image Recognition and Understanding (MIRU), extended abstract, Aug 2018.
  • Y. Huang, M. Cai, Z. Li, and Y. Sato, "Egocentric gaze prediction using task-dependent attention transition," Meeting on Image Recognition and Understanding (MIRU), extended abstract, Aug 2018.
  • Y. Huang, M. Cai, H. Kera, R. Yonetani, K. Higuchi, and Y. Sato, "Spatial-temporal segmentation of joint attention in multiple first-person videos," Meeting on Image Recognition and Understanding (MIRU), extended abstract, Aug 2017.