Gerhard P. Hancke

h-index51

4papers

92citations

Novelty43%

AI Score38

Ranked #84,789 of 194,257 authors (top 44%)#28,584 in CV (top 48%)

4 Papers

5.9CVAug 6, 2023

Language-based Photo Color Adjustment for Graphic Designs

Zhenwei Wang, Nanxuan Zhao, Gerhard Hancke et al.

Adjusting the photo color to associate with some design elements is an essential way for a graphic design to effectively deliver its message and make it aesthetically pleasing. However, existing tools and previous works face a dilemma between the ease of use and level of expressiveness. To this end, we introduce an interactive language-based approach for photo recoloring, which provides an intuitive system that can assist both experts and novices on graphic design. Given a graphic design containing a photo that needs to be recolored, our model can predict the source colors and the target regions, and then recolor the target regions with the source colors based on the given language-based instruction. The multi-granularity of the instruction allows diverse user intentions. The proposed novel task faces several unique challenges, including: 1) color accuracy for recoloring with exactly the same color from the target design element as specified by the user; 2) multi-granularity instructions for parsing instructions correctly to generate a specific result or multiple plausible ones; and 3) locality for recoloring in semantically meaningful local regions to preserve original image semantics. To address these challenges, we propose a model called LangRecol with two main components: the language-based source color prediction module and the semantic-palette-based photo recoloring module. We also introduce an approach for generating a synthetic graphic design dataset with instructions to enable model training. We evaluate our model via extensive experiments and user studies. We also discuss several practical applications, showing the effectiveness and practicality of our approach. Code and data for this paper are at: https://zhenwwang.github.io/langrecol.

7.7SYApr 12

A Review of Hydrogen-Enabled Resilience Enhancement for Multi-Energy Systems

Liang Yu, Haoyu Fang, Goran Strbac et al.

Ensuring resilience in multi-energy systems (MESs) has become increasingly urgent and challenging due to the growing frequency and severity of extreme events, such as natural disasters, extreme weather, and cyber-physical attacks. Among the various approaches to enhancing MES resilience, hydrogen integration offers significant potential in cross-temporal, cross-spatial, and cross-sector flexibility, as well as black-start capability. Although considerable efforts have been devoted to this area, a systematic review of resilience enhancement in hydrogen-enabled MESs is still lacking. To address this gap, this paper presents a comprehensive review of hydrogen-enabled MES resilience enhancement. First, advantages, vulnerabilities, and challenges related to hydrogen-enabled MES resilience enhancement are summarized. Next, a resilience enhancement framework for hydrogen-enabled MESs is proposed, based on which existing resilience metrics and event-oriented contingency models are reviewed and discussed. Planning measures are then classified according to the types of hydrogen-related facilities, together with uncertainty handling methods, scenario generation methods, and planning problem formulation frameworks. In addition, operational enhancement measures are categorized into three response stages: prevention, emergency response, and restoration. Finally, research gaps are identified and future directions are discussed, including comprehensive resilience metric design, advanced extreme-event scenario generation, spatiotemporal cyber-physical contingency modeling under compound extreme events, coordinated planning and operation across multiple networks and timescales, low-carbon resilient planning and operation, and large language model-assisted whole-process resilience enhancement.

15.8CVSep 17, 2024

Phidias: A Generative Model for Creating 3D Content from Text, Image, and 3D Conditions with Reference-Augmented Diffusion

Zhenwei Wang, Tengfei Wang, Zexin He et al.

In 3D modeling, designers often use an existing 3D model as a reference to create new ones. This practice has inspired the development of Phidias, a novel generative model that uses diffusion for reference-augmented 3D generation. Given an image, our method leverages a retrieved or user-provided 3D reference model to guide the generation process, thereby enhancing the generation quality, generalization ability, and controllability. Our model integrates three key components: 1) meta-ControlNet that dynamically modulates the conditioning strength, 2) dynamic reference routing that mitigates misalignment between the input image and 3D reference, and 3) self-reference augmentations that enable self-supervised training with a progressive curriculum. Collectively, these designs result in a clear improvement over existing methods. Phidias establishes a unified framework for 3D generation using text, image, and 3D conditions with versatile applications.

4.6CVSep 27, 2018

Deformable Object Tracking with Gated Fusion

Wenxi Liu, Yibing Song, Dengsheng Chen et al.

The tracking-by-detection framework receives growing attentions through the integration with the Convolutional Neural Networks (CNNs). Existing tracking-by-detection based methods, however, fail to track objects with severe appearance variations. This is because the traditional convolutional operation is performed on fixed grids, and thus may not be able to find the correct response while the object is changing pose or under varying environmental conditions. In this paper, we propose a deformable convolution layer to enrich the target appearance representations in the tracking-by-detection framework. We aim to capture the target appearance variations via deformable convolution, which adaptively enhances its original features. In addition, we also propose a gated fusion scheme to control how the variations captured by the deformable convolution affect the original appearance. The enriched feature representation through deformable convolution facilitates the discrimination of the CNN classifier on the target object and background. Extensive experiments on the standard benchmarks show that the proposed tracker performs favorably against state-of-the-art methods.