Program at a glance



Program

December 14
Oral 1-1 Award Session
89 TFM a Dataset for Detection and Recognition of Masked Faces in the Wild

Gibran Benitez-Garcia (The University of Electro-Communications)*; Miguel Jimenez-Martinez (INSTITUTO POLITECNICO NACIONAL); Jesus Olivares-Mercado (INSTITUTO POLITECNICO NACIONAL); Hiroki Takahashi (the University of Electro-Communications)
93 Deep Image and Kernel Prior Learning for Blind Super-Resolution

Kazuhiro Yamawaki (Yamaguchi University); Xian-Hua Han (Yamaguchi University)*
42 Asymmetric Label Propagation for Video Object Segmentation

Zhen Chen (Peking University); Ming Yang (Horizon Robotics); Shiliang Zhang (Peking University)*
39 Informative Sample-Aware Proxy for Deep Metric Learning

Aoyu Li (Tokyo Institute of Technology)*; Ikuro Sato (Tokyo Institute of Technology / Denso IT Laboratory); Kohta Ishikawa (Denso IT Laboratory, Inc.); Rei Kawakami (Tokyo Institute of Technology); Rio Yokota (Tokyo Institute of Technology)
83 Federated Knowledge Transfer for Heterogeneous Visual Models

Wenzhe Li (Tsinghua University)*; Zirui Zhu (Tsinghua University); Tianchi Huang (Tsinghua University); Lifeng Sun (Tsinghua University); Chun Yuan (Graduate school at ShenZhen, Tsinghua university)
Demo Spotlight
100 A Music Loop Sequencer with User-adaptive Music Loop Selection

Yuki Iwamoto (Nihon University)*; Tetsuro Kitahara (Nihon University)
105 Action Detection System based on Pose Information

Ryo Kawai (NEC Corporation)*; Noboru Yoshida (NEC Corporation); Jianquan Liu (NEC Corporation)
106 DeepHair: a DeepFake-based Hairstyle Preview System

Yu-Hsuan Lo (Taipei National University of the Arts); Shih-Wei Sun (Taipei National University of the Arts)*
107 Emotional Talking Faces: Making Videos More Expressive and Realistic

Sahil Goyal (IIT Roorkee)*; Shagun Uppal (IIIT-Delhi); Sarthak Bhagat (IIIT-Delhi); Dhroov Goel (IIITD); Sakshat Mali (Indraprastha Institute of Information Technology, Delhi); Yi Yu (NII); Yifang Yin (A*STAR); Rajiv Ratn Shah (IIIT Delhi)
109 FoodLog Athl: Multimedia Food Recording Platform for Dietary Guidance and Food Monitoring

Kei Nakamoto (The University of Tokyo)*; Kohei Kumazawa (The University of Tokyo ); Hiroaki KARASAWA (Hongo Software Development); Sosuke Amano (foo.log Inc.); Yoko Yamakata (University of Tokyo, Japan); Kiyoharu Aizawa (The University of Tokyo)
110 Rubber material retrieval system using electron microscope images for rubber material development

Rintaro Yanagi (Hokkaido University)*; Ren Togo (Hokkaido University); Takahiro Ogawa (Hokkaido University); Miki Haseyama (Hokkaido University)
104 JamSketch Deep α: A CNN-based Improvisation System in Accordance with User's Melodic Outline Drawing

Tetsuro Kitahara (Nihon University)*; Akio Yonamine (Nihon University)
108 GSTH266enc: A GStreamer plugin for VVC encoder

Advaiit Rajjvaed (InterDigital Inc.); Saurabh Puri (InterDigital Inc.)*; Gurdeep Bhullar (InterDigital Inc.); Gaelle Martin-Cocher (InterDigital Inc.)
101 Intelligent Video Surveillance Platform Based On FFmpeg And Yolov5

Chuanxu Jiang (HoHai university); Yanfang Wang (Hohai University); Qian Huang (Hohai University)*; Yiming Wang (HoHai university); Yuhan Dai (Hohai University)
Poster+Demo
100 A Music Loop Sequencer with User-adaptive Music Loop Selection

Yuki Iwamoto (Nihon University)*; Tetsuro Kitahara (Nihon University)
105 Action Detection System based on Pose Information

Ryo Kawai (NEC Corporation)*; Noboru Yoshida (NEC Corporation); Jianquan Liu (NEC Corporation)
106 DeepHair: a DeepFake-based Hairstyle Preview System

Yu-Hsuan Lo (Taipei National University of the Arts); Shih-Wei Sun (Taipei National University of the Arts)*
107 Emotional Talking Faces: Making Videos More Expressive and Realistic

Sahil Goyal (IIT Roorkee)*; Shagun Uppal (IIIT-Delhi); Sarthak Bhagat (IIIT-Delhi); Dhroov Goel (IIITD); Sakshat Mali (Indraprastha Institute of Information Technology, Delhi); Yi Yu (NII); Yifang Yin (A*STAR); Rajiv Ratn Shah (IIIT Delhi)
109 FoodLog Athl: Multimedia Food Recording Platform for Dietary Guidance and Food Monitoring

Kei Nakamoto (The University of Tokyo)*; Kohei Kumazawa (The University of Tokyo ); Hiroaki KARASAWA (Hongo Software Development); Sosuke Amano (foo.log Inc.); Yoko Yamakata (University of Tokyo, Japan); Kiyoharu Aizawa (The University of Tokyo)
110 Rubber material retrieval system using electron microscope images for rubber material development

Rintaro Yanagi (Hokkaido University)*; Ren Togo (Hokkaido University); Takahiro Ogawa (Hokkaido University); Miki Haseyama (Hokkaido University)
104 JamSketch Deep α: A CNN-based Improvisation System in Accordance with User's Melodic Outline Drawing

Tetsuro Kitahara (Nihon University)*; Akio Yonamine (Nihon University)
108 GSTH266enc: A GStreamer plugin for VVC encoder

Advaiit Rajjvaed (InterDigital Inc.); Saurabh Puri (InterDigital Inc.)*; Gurdeep Bhullar (InterDigital Inc.); Gaelle Martin-Cocher (InterDigital Inc.)
101 Intelligent Video Surveillance Platform Based On FFmpeg And Yolov5

Chuanxu Jiang (HoHai university); Yanfang Wang (Hohai University); Qian Huang (Hohai University)*; Yiming Wang (HoHai university); Yuhan Dai (Hohai University)
89 TFM a Dataset for Detection and Recognition of Masked Faces in the Wild

Gibran Benitez-Garcia (The University of Electro-Communications)*; Miguel Jimenez-Martinez (INSTITUTO POLITECNICO NACIONAL); Jesus Olivares-Mercado (INSTITUTO POLITECNICO NACIONAL); Hiroki Takahashi (the University of Electro-Communications)
93 Deep Image and Kernel Prior Learning for Blind Super-Resolution

Kazuhiro Yamawaki (Yamaguchi University); Xian-Hua Han (Yamaguchi University)*
42 Asymmetric Label Propagation for Video Object Segmentation

Zhen Chen (Peking University); Ming Yang (Horizon Robotics); Shiliang Zhang (Peking University)*
39 Informative Sample-Aware Proxy for Deep Metric Learning

Aoyu Li (Tokyo Institute of Technology)*; Ikuro Sato (Tokyo Institute of Technology / Denso IT Laboratory); Kohta Ishikawa (Denso IT Laboratory, Inc.); Rei Kawakami (Tokyo Institute of Technology); Rio Yokota (Tokyo Institute of Technology)
83 Federated Knowledge Transfer for Heterogeneous Visual Models

Wenzhe Li (Tsinghua University)*; Zirui Zhu (Tsinghua University); Tianchi Huang (Tsinghua University); Lifeng Sun (Tsinghua University); Chun Yuan (Graduate school at ShenZhen, Tsinghua university)
1 An End-to-End Scene Text Detector with Dynamic Attention

Jingyu Lin (厦门大学); Yan Yan (Xiamen University); Hanzi Wang (Xiamen University)*
50 Self-Attentive CLIP Hashing for Unsupervised Cross-Modal Retrieval

Heng Yu (Nanjing University of Science and Technology); Shuyan Ding (Nanjing University of Science and Technology)*; Lunbo Li ( Nanjing University of Science and Technology); Jiexin Wu (Nanjing university of Science and Technology )
66 Affective Embedding Framework with Semantic Representations from Tweets for Zero-shot Visual Sentiment Prediction

Yingrui Ye (Hokkaido University)*; Yuya Moroto (Hokkaido University); Keisuke Maeda (Hokkaido University); Takahiro Ogawa (Hokkaido University); Miki Haseyama (Hokkaido University)
16 SPEAKER VGG CCT: Cross-corpus Speech Emotion Recognition with Speaker Embedding and Vision

Alessandro Arezzo (University of Florence, Italy); Stefano Berretti (University of Florence, Italy)*
6 Robust Learning with Adversarial Perturbations and Label Noise: A Two-Pronged Defense Approach

Peng-Fei Zhang (University of Queensland)*; Zi Helen Huang (University of Queensland); Xin Luo (Shandong University); Pengfei Zhao (Shandong University)
70 Enhancing the Robustness of Deep Learning Based Fingerprinting to Improve Deepfake Attribution

Chieh-Yin Liao (National Taiwan University)*; Chen-hsiu Huang (National Taiwan University); Jun-Cheng Chen (Academia Sinica); Ja-Ling Wu (NTU)
65 Disentangled Image Attribute Editing in Latent Space via Mask-based Retention Loss

Shunya Ohaga (Hokkaido University)*; Ren Togo (Hokkaido University); Takahiro Ogawa (Hokkaido University); Miki Haseyama (Hokkaido University)
36 ObjectMix: Data Augmentation by Copy-Pasting Objects in Videos for Action Recognition

Jun Kimata (Nagoya Institute of Technology); Tomoya Nitta (Nagoya Insitute of Technology); Toru Tamaki (Nagoya Institute of Technology)*
20 CMR3D: Contextualized Multi-Stage Refinement for 3D Object Detection

Dhanalaxmi Gaddam (Mohammed Bin Zayed University of Artificial Intelligence)*; Jean Lahoud (MBZUAI); Fahad Shahbaz Khan (MBZUAI); Rao Muhammad Anwer (MBZUAI/AALTO); Hisham Cholakkal (MBZUAI)
Oral 1-2 Text, Speech, and Vision
66 Affective Embedding Framework with Semantic Representations from Tweets for Zero-shot Visual Sentiment Prediction

Yingrui Ye (Hokkaido University)*; Yuya Moroto (Hokkaido University); Keisuke Maeda (Hokkaido University); Takahiro Ogawa (Hokkaido University); Miki Haseyama (Hokkaido University)
16 SPEAKER VGG CCT: Cross-corpus Speech Emotion Recognition with Speaker Embedding and Vision

Alessandro Arezzo (University of Florence, Italy); Stefano Berretti (University of Florence, Italy)*
50 Self-Attentive CLIP Hashing for Unsupervised Cross-Modal Retrieval

Heng Yu (Nanjing University of Science and Technology); Shuyan Ding (Nanjing University of Science and Technology)*; Lunbo Li ( Nanjing University of Science and Technology); Jiexin Wu (Nanjing university of Science and Technology )
1 An End-to-End Scene Text Detector with Dynamic Attention

Jingyu Lin (厦门大学); Yan Yan (Xiamen University); Hanzi Wang (Xiamen University)*
December 15
Oral 2-1 Video Compression, Broadcasting, and Analysis
11 Human-Avatar Interaction in Metaverse: Framework for Full-body Interaction

Kit Yung Lam (Hong Kong University of Science and Technology)*; Liang Yang (Hong Kong University of Science and Technology); Ahmad ALHILAL (The Hong Kong University of Science and Technology); Lik Hang Lee (KAIST); Gareth Tyson (Hong Kong University of Science and Technology (GZ)); Pan HUI (The Hong Kong University of Science and Technology)
49 Parallel Queries for Human-Object Interaction Detection

Junwen Chen (The University of Electro-Communications)*; Keiji Yanai (Univ. Electro-Comm., Tokyo)
95 Sequential Frame-Interpolation and DCT-based Video Compression Framework

Yeganeh Jalalpour (Portland State University)*; Wu-chi Feng (Portland State University); Feng Liu (Portland State University)
25 360BroadView: Viewer Management for Viewport Prediction in 360-Degree Video Live Broadcast

Qian Zhou (University of Illinois at Urbana-Champaign)*; Zhe Yang (University of Illinois at Urbana-Champaign); Hongpeng Guo (University of Illinois at Urbana Champaign); Beitong Tian (University of Illinois at Urbana Champaign); Klara Nahrstedt (University of Illinois at Urbana-Champaign)
72 Two-Layer Learning-based P-Frame Coding with Super-Resolution and Content-Adaptive Conditional ANF

David Alexandre (National Yang Ming Chiao Tung University)*; Hsueh-Ming Hang (National Yang Ming Chiao Tung University); Wen-Hsiao Peng (National Yang Ming Chiao Tung University)
74 Learned Bi-Directional Motion Prediction for Video Compression

Yunhui Shi (Beijing University of Technology); Shaopei An (Beijing University of Technology)*; Jin Wang (Beijing University of Technology); Baocai Yin (Beijing University of Technology)
Short Spotlight
9 A Multimodal Sensor Fusion Framework Robust to Missing Modalities for Person Recognition

Vijay John (RIKEN)*; Yasutomo Kawanishi (RIKEN)
34 SLGAN: Style- and Latent-guided Generative Adversarial Network for Desirable Makeup Transfer and Removal

Daichi Horita (The University of Tokyo)*; Kiyoharu Aizawa (The University of Tokyo)
64 Popularity-aware Graph Social Recommendation for Fully Non-Interaction Users

Nozomu Onodera (Hokkaido University)*; Keisuke Maeda (Hokkaido University); Takahiro Ogawa (Hokkaido University); Miki Haseyama (Hokkaido University)
28 Multimodal Fusion with Cross-Modal Attention for Action Recognition in Still Images

Jia-Hua Tsai (Department of Computer Science and Information Engineering, National Cheng-Kung University); Wei-Ta Chu (National Cheng Kung University)*
45 Zero-shot Font Style Transfer with a Differentiable Renderer

Kota Izumi (The University of Electro-Communications); Keiji Yanai (Univ. Electro-Comm., Tokyo)*
98 Wearable Camera Based Food Logging System

Kenshiro Sato (The University of Tokyo)*; Yoko Yamakata (University of Tokyo, Japan); Sosuke Amano (foo.log Inc.); Kiyoharu Aizawa (The University of Tokyo)
90 Graph Neural Network Based Living Comfort Prediction Using Real Estate Floor Plan Images

Ryota Kitabayashi (The University of Tokyo)*; Taro Narahara (njit); Toshihiko Yamasaki (The University of Tokyo)
48 Wider or Deeper Neural Network Architecture for Acoustic Scene Classification with Mismatched Recording Devices

LAM PHAM (Austrian Institute of Technology)*; Khoa Tran (Da Nang University); Dat Ngo (University of Essex); Hieu Tang (FPT University); Son Phan (Earable AI); Alexander Schindler (Austrian Institute of Technology)
24 A Reality Check of Positioning in Multiuser Mobile Augmented Reality: Measurement and Analysis

Na Wang (George Mason University)*; Haoliang Wang (Adobe Research); Stefano Petrangeli (Adobe); Viswanathan (Vishy) Swaminathan (Adobe); Fei Li (George Mason University); Songqing Chen (George Mason University)
59 Towards High Performance One-Stage Human Pose Estimation

Ling Li (Nanjing University of Science and Technology)*; Lin Zhao (Nanjing University of Science and Technology); Linhao Xu (Nanjing University of Science and Technology); Jie Xu (Nanjing University of Science and Technology)
85 Singing Voice Detection via Similarity-based Semi-supervised Learning Method

Xi Chen (Fudan University)*
Poster+Demo
100 A Music Loop Sequencer with User-adaptive Music Loop Selection

Yuki Iwamoto (Nihon University)*; Tetsuro Kitahara (Nihon University)
105 Action Detection System based on Pose Information

Ryo Kawai (NEC Corporation)*; Noboru Yoshida (NEC Corporation); Jianquan Liu (NEC Corporation)
106 DeepHair: a DeepFake-based Hairstyle Preview System

Yu-Hsuan Lo (Taipei National University of the Arts); Shih-Wei Sun (Taipei National University of the Arts)*
107 Emotional Talking Faces: Making Videos More Expressive and Realistic

Sahil Goyal (IIT Roorkee)*; Shagun Uppal (IIIT-Delhi); Sarthak Bhagat (IIIT-Delhi); Dhroov Goel (IIITD); Sakshat Mali (Indraprastha Institute of Information Technology, Delhi); Yi Yu (NII); Yifang Yin (A*STAR); Rajiv Ratn Shah (IIIT Delhi)
109 FoodLog Athl: Multimedia Food Recording Platform for Dietary Guidance and Food Monitoring

Kei Nakamoto (The University of Tokyo)*; Kohei Kumazawa (The University of Tokyo ); Hiroaki KARASAWA (Hongo Software Development); Sosuke Amano (foo.log Inc.); Yoko Yamakata (University of Tokyo, Japan); Kiyoharu Aizawa (The University of Tokyo)
110 Rubber material retrieval system using electron microscope images for rubber material development

Rintaro Yanagi (Hokkaido University)*; Ren Togo (Hokkaido University); Takahiro Ogawa (Hokkaido University); Miki Haseyama (Hokkaido University)
104 JamSketch Deep α: A CNN-based Improvisation System in Accordance with User's Melodic Outline Drawing

Tetsuro Kitahara (Nihon University)*; Akio Yonamine (Nihon University)
108 GSTH266enc: A GStreamer plugin for VVC encoder

Advaiit Rajjvaed (InterDigital Inc.); Saurabh Puri (InterDigital Inc.)*; Gurdeep Bhullar (InterDigital Inc.); Gaelle Martin-Cocher (InterDigital Inc.)
101 Intelligent Video Surveillance Platform Based On FFmpeg And Yolov5

Chuanxu Jiang (HoHai university); Yanfang Wang (Hohai University); Qian Huang (Hohai University)*; Yiming Wang (HoHai university); Yuhan Dai (Hohai University)
9 A Multimodal Sensor Fusion Framework Robust to Missing Modalities for Person Recognition

Vijay John (RIKEN)*; Yasutomo Kawanishi (RIKEN)
34 SLGAN: Style- and Latent-guided Generative Adversarial Network for Desirable Makeup Transfer and Removal

Daichi Horita (The University of Tokyo)*; Kiyoharu Aizawa (The University of Tokyo)
64 Popularity-aware Graph Social Recommendation for Fully Non-Interaction Users

Nozomu Onodera (Hokkaido University)*; Keisuke Maeda (Hokkaido University); Takahiro Ogawa (Hokkaido University); Miki Haseyama (Hokkaido University)
28 Multimodal Fusion with Cross-Modal Attention for Action Recognition in Still Images

Jia-Hua Tsai (Department of Computer Science and Information Engineering, National Cheng-Kung University); Wei-Ta Chu (National Cheng Kung University)*
45 Zero-shot Font Style Transfer with a Differentiable Renderer

Kota Izumi (The University of Electro-Communications); Keiji Yanai (Univ. Electro-Comm., Tokyo)*
98 Wearable Camera Based Food Logging System

Kenshiro Sato (The University of Tokyo)*; Yoko Yamakata (University of Tokyo, Japan); Sosuke Amano (foo.log Inc.); Kiyoharu Aizawa (The University of Tokyo)
90 Graph Neural Network Based Living Comfort Prediction Using Real Estate Floor Plan Images

Ryota Kitabayashi (The University of Tokyo)*; Taro Narahara (njit); Toshihiko Yamasaki (The University of Tokyo)
48 Wider or Deeper Neural Network Architecture for Acoustic Scene Classification with Mismatched Recording Devices

LAM PHAM (Austrian Institute of Technology)*; Khoa Tran (Da Nang University); Dat Ngo (University of Essex); Hieu Tang (FPT University); Son Phan (Earable AI); Alexander Schindler (Austrian Institute of Technology)
24 A Reality Check of Positioning in Multiuser Mobile Augmented Reality: Measurement and Analysis

Na Wang (George Mason University)*; Haoliang Wang (Adobe Research); Stefano Petrangeli (Adobe); Viswanathan (Vishy) Swaminathan (Adobe); Fei Li (George Mason University); Songqing Chen (George Mason University)
59 Towards High Performance One-Stage Human Pose Estimation

Ling Li (Nanjing University of Science and Technology)*; Lin Zhao (Nanjing University of Science and Technology); Linhao Xu (Nanjing University of Science and Technology); Jie Xu (Nanjing University of Science and Technology)
85 Singing Voice Detection via Similarity-based Semi-supervised Learning Method

Xi Chen (Fudan University)*
11 Human-Avatar Interaction in Metaverse: Framework for Full-body Interaction

Kit Yung Lam (Hong Kong University of Science and Technology)*; Liang Yang (Hong Kong University of Science and Technology); Ahmad ALHILAL (The Hong Kong University of Science and Technology); Lik Hang Lee (KAIST); Gareth Tyson (Hong Kong University of Science and Technology (GZ)); Pan HUI (The Hong Kong University of Science and Technology)
49 Parallel Queries for Human-Object Interaction Detection

Junwen Chen (The University of Electro-Communications)*; Keiji Yanai (Univ. Electro-Comm., Tokyo)
95 Sequential Frame-Interpolation and DCT-based Video Compression Framework

Yeganeh Jalalpour (Portland State University)*; Wu-chi Feng (Portland State University); Feng Liu (Portland State University)
25 360BroadView: Viewer Management for Viewport Prediction in 360-Degree Video Live Broadcast

Qian Zhou (University of Illinois at Urbana-Champaign)*; Zhe Yang (University of Illinois at Urbana-Champaign); Hongpeng Guo (University of Illinois at Urbana Champaign); Beitong Tian (University of Illinois at Urbana Champaign); Klara Nahrstedt (University of Illinois at Urbana-Champaign)
72 Two-Layer Learning-based P-Frame Coding with Super-Resolution and Content-Adaptive Conditional ANF

David Alexandre (National Yang Ming Chiao Tung University)*; Hsueh-Ming Hang (National Yang Ming Chiao Tung University); Wen-Hsiao Peng (National Yang Ming Chiao Tung University)
74 Learned Bi-Directional Motion Prediction for Video Compression

Yunhui Shi (Beijing University of Technology); Shaopei An (Beijing University of Technology)*; Jin Wang (Beijing University of Technology); Baocai Yin (Beijing University of Technology)
55 Deep Enhancement-Object Features Fusion for Low-light Object Detection

Wan Teng Lim (Multimedia University); Kelvin Ang (Multimedia University); Yuen Peng Loh (Multimedia University)*
7 Image Compression for Machines Using Boundary-Enhanced Saliency

Yuanyuan Xu (Hohai University)*; Haolun Lan (Hohai University)
30 Deep Weighted Guided Upsampling Network for Depth of Field Image Upsampling

Lanling Zeng (Jiangsu University)*; Lianxiong Wu (Jiangsu University); Yang Yang ( Jiangsu University); Xiang-Jun Shen (Jiangsu University); Yongzhao Zhan (UJS)
57 Multispectral Image Denoising Via Structural Tensor Sparsity Promoting Model

longlu huang (Beijing University Of Technology); Na Qi (Beijing University of Technology)*; Qing Zhu (Beijing University of Technology)
53 Multi-scale Channel Transformer Network for Single Image Deraining

Yuto Namba (Yamaguchi University)*; Xian-Hua Han (Yamaguchi University)
67 Remote sensing image colorization based on Joint Stream Deep Convolutional Generative Adversarial Networks

Jingyu J Wang (Ocean University of China); Jie Nie (Ocean University of China)*; Hao Chen (Ocean University of China); Huaxin Xie (Ocean University of China); chengyu zheng (Ocean University of China); Min Ye (Ocean university of China); Zhiqiang Wei (Ocean University of China)
86 On the Robustness of 3D Object Detectors

Fatima A Albreiki (MBZUAI ); Sultan Abughazal (MBZUAI)*; Jean Lahoud (MBZUAI); Rao Anwer (MBZUAI); Hisham Cholakkal (MBZUAI); Fahad Shahbaz Khan (MBZUAI)
December 16
Oral 3-1 Low-level Vision and Image Processing
55 Deep Enhancement-Object Features Fusion for Low-light Object Detection

Wan Teng Lim (Multimedia University); Kelvin Ang (Multimedia University); Yuen Peng Loh (Multimedia University)*
7 Image Compression for Machines Using Boundary-Enhanced Saliency

Yuanyuan Xu (Hohai University)*; Haolun Lan (Hohai University)
30 Deep Weighted Guided Upsampling Network for Depth of Field Image Upsampling

Lanling Zeng (Jiangsu University)*; Lianxiong Wu (Jiangsu University); Yang Yang ( Jiangsu University); Xiang-Jun Shen (Jiangsu University); Yongzhao Zhan (UJS)
57 Multispectral Image Denoising Via Structural Tensor Sparsity Promoting Model

longlu huang (Beijing University Of Technology); Na Qi (Beijing University of Technology)*; Qing Zhu (Beijing University of Technology)
53 Multi-scale Channel Transformer Network for Single Image Deraining

Yuto Namba (Yamaguchi University)*; Xian-Hua Han (Yamaguchi University)
67 Remote sensing image colorization based on Joint Stream Deep Convolutional Generative Adversarial Networks

Jingyu J Wang (Ocean University of China); Jie Nie (Ocean University of China)*; Hao Chen (Ocean University of China); Huaxin Xie (Ocean University of China); chengyu zheng (Ocean University of China); Min Ye (Ocean university of China); Zhiqiang Wei (Ocean University of China)
Oral 3-2 Robustness, Data Augmentation and Disentangling
86 On the Robustness of 3D Object Detectors

Fatima A Albreiki (MBZUAI ); Sultan Abughazal (MBZUAI)*; Jean Lahoud (MBZUAI); Rao Anwer (MBZUAI); Hisham Cholakkal (MBZUAI); Fahad Shahbaz Khan (MBZUAI)
6 Robust Learning with Adversarial Perturbations and Label Noise: A Two-Pronged Defense Approach

Peng-Fei Zhang (University of Queensland)*; Zi Helen Huang (University of Queensland); Xin Luo (Shandong University); Pengfei Zhao (Shandong University)
70 Enhancing the Robustness of Deep Learning Based Fingerprinting to Improve Deepfake Attribution

Chieh-Yin Liao (National Taiwan University)*; Chen-hsiu Huang (National Taiwan University); Jun-Cheng Chen (Academia Sinica); Ja-Ling Wu (NTU)
65 Disentangled Image Attribute Editing in Latent Space via Mask-based Retention Loss

Shunya Ohaga (Hokkaido University)*; Ren Togo (Hokkaido University); Takahiro Ogawa (Hokkaido University); Miki Haseyama (Hokkaido University)
36 ObjectMix: Data Augmentation by Copy-Pasting Objects in Videos for Action Recognition

Jun Kimata (Nagoya Institute of Technology); Tomoya Nitta (Nagoya Insitute of Technology); Toru Tamaki (Nagoya Institute of Technology)*
20 CMR3D: Contextualized Multi-Stage Refinement for 3D Object Detection

Dhanalaxmi Gaddam (Mohammed Bin Zayed University of Artificial Intelligence)*; Jean Lahoud (MBZUAI); Fahad Shahbaz Khan (MBZUAI); Rao Muhammad Anwer (MBZUAI/AALTO); Hisham Cholakkal (MBZUAI)