|
| Sequential Voting with Relational Box Fields for Active Object Detection |
|
| Secure & Personalized Music-to-Video Generation via CHARCHA |
|
|
| Depth-supervised NeRF: Fewer Views and Faster Training for Free |
|
| 3D reconstruction with fast dipole sums |
|
| Bottom Up Top Down Detection Transformers for Language Grounding in Images and Point Clouds |
|
| Differentiable Raycasting for Self-supervised Occupancy Forecasting |
|
| Ensembling Off-the-shelf Models for GAN Training |
|
| Swept-Angle Synthetic Wavelength Interferometry |
|
| Multi-Concept Customization of Text-to-Image Diffusion |
|
| Human-to-Robot Imitation in the Wild |
|
| RAC: Reconstructing Animatable Categories from Videos |
|
| RelPose: Predicting Probabilistic Relative Rotation for Single Objects in the Wild |
|
| Tactile DreamFusion: Exploiting Tactile Sensing for 3D Generation |
|
| Dataset Distillation by Matching Training Trajectories |
|
| Megahertz Light Steering without Moving Parts |
|
| PPR: Physically Plausible Reconstruction from Monocular Videos |
|
| Neural Implicit Surface Reconstruction using Imaging Sonar |
|
| Evaluating Data Attribution for Text-to-Image Models |
|
| Objects as volumes: A stochastic geometry view of opaque solids |
|
| Ablating Concepts in Text-to-Image Diffusion Models |
|
| Dual-Shutter Optical Vibration Sensing |
|
| 3D-aware Conditional Image Synthesis |
|
| Fuzzy Metaballs: Approximate Differentiable Rendering with Algebraic Surfaces |
|
| Efficient Spatially Sparse Inference for Conditional GANs and Diffusion Models |