Publications

publications by categories in reversed chronological order. generated by jekyll-scholar.

2024

  1. ToddlerDiffusion: Flash Interpretable Controllable Diffusion Model
    Eslam Mohamed BAKR, Liangbing Zhao, Vincent Tao Hu, Matthieu Cord, Patrick Perez, and Mohamed Elhoseiny
    In Submission, 2024

2023

  1. ./scribbleseg.png
    Generative Data Augmentation Improves Scribble-supervised Semantic Segmentation
    Jacob Schnell, Jieke Wang, Lu Qi, Vincent Tao Hu, and Meng Tang
    In ARXIV, 2023
    Explore diffusion model for data augmention in segmentation task.
  2. ./fsinr.png
    On the Few-Shot Generalization of Learning on Implicit Neural Representations
    Tao HuDavid W Zhang, Yunlu Chen, Teng Long, Yuki Asano, Pascal MettesEfstratios GavvesBasura Fernando, and Cees G.M. Snoek
    In ICCV NeRF4ADR Workshop, 2023
    Explore few-shot generalization of INR on images.
  3. Query by Activity Video in the Wild
    In ICIP, 2023
    Few-shot video retrieval.
  4. ./fm-s2s.png
    Flow Matching for Conditional Text Generation in a Single Sampling Step
    In Submission, 2023
    Flow Matching for text generation
  5. ./fm.png
    Latent Space Editing in Transformer-based Flow Matching
    In ICML 2023 Workshop, New Frontiers in Learning, Control, and Dynamical Systems, 2023
  6. ./sgdm-why.png
    Self-Guided Diffusion Models
    In CVPR, 2023
    A bridge between the community of self-supervised learning and diffusion models. Short version to appear in NeurIPS 2022 Workshop on Score-Based Methods and NeurIPS 2022 Workshop Self-Supervised Learning Theory and Practice.

2021

  1. ./video_retrieval.png
    Self-supervised Video Representation Learning with Cross-Stream Prototypical Contrasting
    Martine Toering, Ioannis Gatopoulos, Maarten Stol, and Tao Hu
    In WACV, 2021
    Improve video representation by constrasting Prototypical features.

2020

  1. ./focal.png
    Localizing the Common Action Among a Few Videos
    Pengwan Yang*, Tao Hu*Pascal Mettes, and Cees G.M. Snoek
    In European Conference on Computer Vision(ECCV), 2020
    Localizing the temporal extent of an action in a long untrimmed video by attention techniques.
  2. ./pointmixup.gif
    Pointmixup: Augmentation for point clouds
    Yunlu Chen*, Tao Hu*Efstratios GavvesThomas MensinkPascal Mettes, Pengwan Yang, and Cees G.M. Snoek
    In European Conference on Computer Vision(ECCV), 2020
    A simple augmentation method based on MixUp to boost the performance on related tasks of point cloud.
  3. Interactivity proposals for surveillance videos
    In International Conference on Multimedia Retrieval(ICMR), 2020

2019

  1. ./avatar_amcg.png
    Attention-based Multi-Context Guiding for Few-Shot Semantic Segmentation
    Tao Hu, Pengwan Yang, Chiliang Zhang, Gang YuYadong Mu, and Cees G.M. Snoek
    In AAAI, 2019
    Solve the few-shot segmentation problem by applying attention in multi-scales.
  2. ./avatar_silco.png
    SILCO: Show a Few Images, Localize the Common Object
    Tao HuPascal Mettes, Jia-Hong Huang, and Cees G.M. Snoek
    In International Conference on Computer Vision(ICCV), 2019
    Design a graph network and apply attention on them to solve the problem of common object localization.

2018

  1. Dense In Dense: Training Segmentation from Scratch
    In Asian Conference on Computer Vision(ACCV), 2018
  2. Sobel heuristic kernel for aerial semantic segmentation
    Tao Hu, Yao Wang, Yisong Chen, Peng Lu, and Heng Wang
    In IEEE International Conference on Image Processing (ICIP), 2018
  3. Accelerating convolutional neural networks with dynamic channel pruning
    Chiliang Zhang, Tao Hu, Yingda Guan, and Zuochang Ye
    In Data Compression Conference (DCC), 2018