Yonglin Li

CV
3papers
69citations
Novelty32%
AI Score21

3 Papers

CVNov 24, 2022
1st Workshop on Maritime Computer Vision (MaCVi) 2023: Challenge Results

Benjamin Kiefer, Matej Kristan, Janez Perš et al.

The 1$^{\text{st}}$ Workshop on Maritime Computer Vision (MaCVi) 2023 focused on maritime computer vision for Unmanned Aerial Vehicles (UAV) and Unmanned Surface Vehicle (USV), and organized several subchallenges in this domain: (i) UAV-based Maritime Object Detection, (ii) UAV-based Maritime Object Tracking, (iii) USV-based Maritime Obstacle Segmentation and (iv) USV-based Maritime Obstacle Detection. The subchallenges were based on the SeaDronesSee and MODS benchmarks. This report summarizes the main findings of the individual subchallenges and introduces a new benchmark, called SeaDronesSee Object Detection v2, which extends the previous benchmark by including more classes and footage. We provide statistical and qualitative analyses, and assess trends in the best-performing methodologies of over 130 submissions. The methods are summarized in the appendix. The datasets, evaluation code and the leaderboard are publicly available at https://seadronessee.cs.uni-tuebingen.de/macvi.

NAJun 25, 2018
FEM and CIP-FEM for Helmholtz Equation with High Wave Number and PML truncation

Yonglin Li, Haijun Wu

The Helmholtz scattering problem with high wave number is truncated by the perfectly matched layer (PML) technique and then discretized by the linear continuous interior penalty finite element method (CIP-FEM). It is proved that the truncated PML problem satisfies the inf--sup condition with inf--sup constant of order $O(k^{-1})$. Stability and convergence of the truncated PML problem are discussed. In particular, the convergence rate is twice of the previous result. The preasymptotic error estimates in the energy norm of the linear CIP-FEM as well as FEM are proved to be $C_1kh+C_2k^3h^2$ under the mesh condition that $k^3h^2$ is sufficiently small. Numerical tests are provided to illustrate the preasymptotic error estimates and show that the penalty parameter in the CIP-FEM may be tuned to reduce greatly the pollution error.

CVJul 3, 2023
RefSAM: Efficiently Adapting Segmenting Anything Model for Referring Video Object Segmentation

Yonglin Li, Jing Zhang, Xiao Teng et al.

The Segment Anything Model (SAM) has gained significant attention for its impressive performance in image segmentation. However, it lacks proficiency in referring video object segmentation (RVOS) due to the need for precise user-interactive prompts and a limited understanding of different modalities, such as language and vision. This paper presents the RefSAM model, which explores the potential of SAM for RVOS by incorporating multi-view information from diverse modalities and successive frames at different timestamps in an online manner. Our proposed approach adapts the original SAM model to enhance cross-modality learning by employing a lightweight Cross-Modal MLP that projects the text embedding of the referring expression into sparse and dense embeddings, serving as user-interactive prompts. Additionally, we have introduced the hierarchical dense attention module to fuse hierarchical visual semantic information with sparse embeddings to obtain fine-grained dense embeddings, and an implicit tracking module to generate a tracking token and provide historical information for the mask decoder. Furthermore, we employ a parameter-efficient tuning strategy to align and fuse the language and vision features effectively. Through comprehensive ablation studies, we demonstrate our model's practical and effective design choices. Extensive experiments conducted on Refer-Youtube-VOS, Ref-DAVIS17, and three referring image segmentation datasets validate the superiority and effectiveness of our RefSAM model over existing methods.