Interested in building controllable & reliable machine vision systems? Some questions we’ll explore together:
🔧 How can we guide models to produce outputs that follow constraints?
🔎 How can we make models easier to interpret and explain?
📊 How can we measure and reduce uncertainty in predictions?
If this fits you, please email your CV (with publication list, transcripts, and SOP) to
hle40@charlotte.edu with the subject line
[PhDApplicant-YourName].
News
[2025] Iterative meshing of UDFs has been accepted to NeurIPS 2025.
[2025] Two papers have been accepted to ICCV 2025, both are selected for Oral presentation (64 out of 11,239 submissions).
[2025] Pair-wise implicit constraint has been early accepted (top 9%) to MICCAI 2025.
[2025] Qt-DoG 🐶 has been accepted to ICML 2025.
[2025] A paper on personalized scanpath prediction has been accepted to CVPR 2025.
[2025] A paper on identifying reliable seeds for diffusion models has been accepted to ICLR 2025 as a Spotlight paper.
TL;DR: A novel token-merging method speeds up diffusion models by preserving important tokens,
enhancing sample quality with minimal computational cost.
Counting Stacked Objects
Corentin Dumery, Noa Ette, Aoxiang Fan, Ren Li, Jingyi Xu, Hieu Le, Pascal Fua
ICCV 2025.
[Preprint]
TL;DR: We do 3D volume reconstruction from multi-view images to count hidden objects
Gradient Distance Function
Hieu Le, Federico Stella, Benoit Guillard, and Pascal Fua
ICCV - Wild3D Workshop 2025.
[Preprint]
Pairwise-Constrained Implicit Functions for 3D Human Heart Modeling
Hieu Le, Jingyi Xu, Nicolas Talabot, Jiancheng Yang, Pascal Fua.
MICCAI 2025.
[Preprint]
We use monte a carlo sampling method for finding relevant points to refine pairs of SDF surfaces, making them contact each other without penetrating.
QT-DoG: Quantization-Aware Training for Domain Generalization
TL;DR: QT-DoG uses weight quantization as a regularizer to encourage flatter minima, enhancing
domain generalization. We also introduce ensembles of quantized models, achieving SoTA performance in DG.
TL;DR: Noises are important for diffusion-based models. We show that some random seeds are much more reliable than others, which can be used to generate useful training data.
Learning to Count from Pseudo-Labeled Segmentation
Jingyi Xu, Hieu Le, Dimitris Samaras
WACV 2025.
[Preprint]
Existing methods suffer from the counting-everything issues. We introduce a benchmark with multiple
countable objects in each image and show that we can mitigate this issues by using synthetic data.
Shadow Removal Refinement via Material-Consistent Shadow Edges
We proposed a new shadow removal method leveraging supervision from the shadow edges. Plus, we
introduced a new benchmark for shadow removal with no shadow-free images needed!
Assessing Sample Quality via the Latent Space of Generative Models
Latent points located in dense regions of the latent manifold tend to produce higher-quality
samples, while those in sparser regions yield lower-quality ones.
Controlling the Fidelity and Diversity of Deep Generative Models via Pseudo
Density
Learning Frame-Wise Emotion Intensity for Audio-Driven Talking-Head Generation
Jingyi Xu, Hieu Le, Zhixin Shu, Yang Wang, Yi-Hsuan Tsai,
Dimitris Samaras, 2025 [Preprint]
We introduce a novel method for generating talking-head videos driven by audio input, incorporating
frame-wise emotion intensity to enhance realism and expressiveness in visual output. The intensity
is
pseudo-labeled and we introduce a latent space that facilitates generating videos with varied
frame-wise intensities.
We introduces a novel approach to referring expression segmentation with instance-awareness,
automatically finding and linking object instances in the image with the textual entities describing
them.
MedTet: An Online Motion Model for 4D Heart Reconstruction
TL;DR: This paper introduces a versatile framework for reconstructing 3D cardiac motion from limited real-time data, such as 2D slices or even 1D signals. Using a deformable tetrahedral grid, the method ensures anatomically consistent reconstructions suitable for intraoperative scenarios.
Services
Journal Reviewer:
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI)
International Journal of Computer Vision (IJCV)
IEEE Transactions on Image Processing (TIP)
Computer Vision and Image Understanding (CVIU)
Journal of Photogrammetry and Remote Sensing (ISPRS)
Conference Reviewer:
IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
International Conference on Computer Vision (ICCV)
European Conference on Computer Vision (ECCV)
AAAI Conference on Artificial Intelligence (AAAI)
Asian Conference on Computer Vision (ACCV)
The International Conference on Learning Representations (ICLR)
International Conference on Machine Learning (ICML)
Conference on Neural Information Processing Systems
(NeurISP)