Computer Vision @ Carnegie Mellon
People Research Courses
Sequential Voting with Relational Box Fields for Active Object Detection
Secure & Personalized Music-to-Video Generation via CHARCHA
Depth-supervised NeRF: Fewer Views and Faster Training for Free
3D reconstruction with fast dipole sums
Bottom Up Top Down Detection Transformers for Language Grounding in Images and Point Clouds
Differentiable Raycasting for Self-supervised Occupancy Forecasting
Ensembling Off-the-shelf Models for GAN Training
Swept-Angle Synthetic Wavelength Interferometry
Multi-Concept Customization of Text-to-Image Diffusion
Human-to-Robot Imitation in the Wild
RAC: Reconstructing Animatable Categories from Videos
RelPose: Predicting Probabilistic Relative Rotation for Single Objects in the Wild
Tactile DreamFusion: Exploiting Tactile Sensing for 3D Generation
Dataset Distillation by Matching Training Trajectories
Megahertz Light Steering without Moving Parts
PPR: Physically Plausible Reconstruction from Monocular Videos
Neural Implicit Surface Reconstruction using Imaging Sonar
Evaluating Data Attribution for Text-to-Image Models
Objects as volumes: A stochastic geometry view of opaque solids
Ablating Concepts in Text-to-Image Diffusion Models
Dual-Shutter Optical Vibration Sensing
3D-aware Conditional Image Synthesis
Fuzzy Metaballs: Approximate Differentiable Rendering with Algebraic Surfaces
Efficient Spatially Sparse Inference for Conditional GANs and Diffusion Models

Carnegie Mellon University 5000 Forbes Ave Pittsburgh, PA 15213