CVMar 24, 2023
DiffuScene: Denoising Diffusion Models for Generative Indoor Scene SynthesisJiapeng Tang, Yinyu Nie, Lev Markhasin et al.
We present DiffuScene for indoor 3D scene synthesis based on a novel scene configuration denoising diffusion model. It generates 3D instance properties stored in an unordered object set and retrieves the most similar geometry for each object configuration, which is characterized as a concatenation of different attributes, including location, size, orientation, semantics, and geometry features. We introduce a diffusion network to synthesize a collection of 3D indoor objects by denoising a set of unordered object attributes. Unordered parametrization simplifies and eases the joint distribution approximation. The shape feature diffusion facilitates natural object placements, including symmetries. Our method enables many downstream applications, including scene completion, scene arrangement, and text-conditioned scene synthesis. Experiments on the 3D-FRONT dataset show that our method can synthesize more physically plausible and diverse indoor scenes than state-of-the-art methods. Extensive ablation studies verify the effectiveness of our design choice in scene diffusion models.
CVOct 11, 2022
Neural Shape Deformation PriorsJiapeng Tang, Lev Markhasin, Bi Wang et al.
We present Neural Shape Deformation Priors, a novel method for shape manipulation that predicts mesh deformations of non-rigid objects from user-provided handle movements. State-of-the-art methods cast this problem as an optimization task, where the input source mesh is iteratively deformed to minimize an objective function according to hand-crafted regularizers such as ARAP. In this work, we learn the deformation behavior based on the underlying geometric properties of a shape, while leveraging a large-scale dataset containing a diverse set of non-rigid deformations. Specifically, given a source mesh and desired target locations of handles that describe the partial surface deformation, we predict a continuous deformation field that is defined in 3D space to describe the space deformation. To this end, we introduce transformer-based deformation networks that represent a shape deformation as a composition of local surface deformations. It learns a set of local latent codes anchored in 3D space, from which we can learn a set of continuous deformation functions for local surfaces. Our method can be applied to challenging deformations and generalizes well to unseen deformations. We validate our approach in experiments using the DeformingThing4D dataset, and compare to both classic optimization-based and recent neural network-based methods.
NTApr 15, 2016
Optimal $L_p$-discrepancy bounds for second order digital sequencesJosef Dick, Aicke Hinrichs, Lev Markhasin et al.
The $L_p$-discrepancy is a quantitative measure for the irregularity of distribution modulo one of infinite sequences. In 1986 Proinov proved for all $p>1$ a lower bound for the $L_p$-discrepancy of general infinite sequences in the $d$-dimensional unit cube, but it remained an open question whether this lower bound is best possible in the order of magnitude until recently. In 2014 Dick and Pillichshammer gave a first construction of an infinite sequence whose order of $L_2$-discrepancy matches the lower bound of Proinov. Here we give a complete solution to this problem for all finite $p > 1$. We consider so-called order $2$ digital $(t,d)$-sequences over the finite field with two elements and show that such sequences achieve the optimal order of $L_p$-discrepancy simultaneously for all $p \in (1,\infty)$.
NASep 21, 2011
On lower bounds for the L_2-discrepancyAicke Hinrichs, Lev Markhasin
The L_2-discrepancy measures the irregularity of the distribution of a finite point set. In this note we prove lower bounds for the L_2 discrepancy of arbitrary N-point sets. Our main focus is on the two-dimensional case. Asymptotic upper and lower estimates of the L_2-discrepancy in dimension 2 are well-known and are of the sharp order sqrt(log N). Nevertheless the gap in the constants between the best known lower and upper bounds is unsatisfactory large for a two-dimensional problem. Our lower bound improves upon this situation considerably. The main method is an adaption of the method of K. F. Roth using the Fourier coefficients of the discrepancy function with respect to the Haar basis.
NAOct 31, 2012
Discrepancy of generalized Hammersley type point sets in Besov spaces of dominating mixed smoothnessLev Markhasin
The symmetrized Hammersley point set is known to achieve the best possible rate for the $L_2$-norm of the discrepancy function. Also lower bounds for the norm in Besov spaces of dominating mixed smoothness are known. In this paper a large class of point sets which are generalizations of the Hammersley type point sets are proved to asymptotically achieve the known lower bound of the Besov norm. The proof uses a $b$-adic generalization of the Haar system. This result can be regarded as a preparation for the proof in arbitrary dimension.
NAOct 29, 2012
Quasi-Monte Carlo methods for integration of functions with dominating mixed smoothness in arbitrary dimensionLev Markhasin
In a celebrated construction, Chen and Skriganov gave explicit examples of point sets achieving the best possible $L_2$-norm of the discrepancy function. We consider the discrepancy function of the Chen-Skriganov point sets in Besov spaces with dominating mixed smoothness and show that they also achieve the best possible rate in this setting. The proof uses a $b$-adic generalization of the Haar system and corresponding characterizations of the Besov space norm. Results for further function spaces and integration errors are concluded.
CVJul 12, 2025
EgoAnimate: Generating Human Animations from Egocentric top-down ViewsG. Kutay Türkoglu, Julian Tanke, Iheb Belgacem et al.
An ideal digital telepresence experience requires accurate replication of a person's body, clothing, and movements. To capture and transfer these movements into virtual reality, the egocentric (first-person) perspective can be adopted, which enables the use of a portable and cost-effective device without front-view cameras. However, this viewpoint introduces challenges such as occlusions and distorted body proportions. There are few works reconstructing human appearance from egocentric views, and none use a generative prior-based approach. Some methods create avatars from a single egocentric image during inference, but still rely on multi-view datasets during training. To our knowledge, this is the first study using a generative backbone to reconstruct animatable avatars from egocentric inputs. Based on Stable Diffusion, our method reduces training burden and improves generalizability. Inspired by methods such as SiTH and MagicMan, which perform 360-degree reconstruction from a frontal image, we introduce a pipeline that generates realistic frontal views from occluded top-down images using ControlNet and a Stable Diffusion backbone. Our goal is to convert a single top-down egocentric image into a realistic frontal representation and feed it into an image-to-motion model. This enables generation of avatar motions from minimal input, paving the way for more accessible and generalizable telepresence systems.
CVDec 16, 2021
TAFIM: Targeted Adversarial Attacks against Facial Image ManipulationsShivangi Aneja, Lev Markhasin, Matthias Niessner
Face manipulation methods can be misused to affect an individual's privacy or to spread disinformation. To this end, we introduce a novel data-driven approach that produces image-specific perturbations which are embedded in the original images. The key idea is that these protected images prevent face manipulation by causing the manipulation model to produce a predefined manipulation target (uniformly colored output image in our case) instead of the actual manipulation. In addition, we propose to leverage differentiable compression approximation, hence making generated perturbations robust to common image compression. In order to prevent against multiple manipulation methods simultaneously, we further propose a novel attention-based fusion of manipulation-specific perturbations. Compared to traditional adversarial attacks that optimize noise patterns for each image individually, our generalized model only needs a single forward pass, thus running orders of magnitude faster and allowing for easy integration in image processing stacks, even on resource-constrained devices like smartphones.