Zhe Shen

RO
h-index117
6papers
5,262citations
Novelty46%
AI Score38

6 Papers

CLJul 31, 2024
Gemma 2: Improving Open Language Models at a Practical Size

Gemma Team, Morgane Riviere, Shreya Pathak et al. · deepmind

In this work, we introduce Gemma 2, a new addition to the Gemma family of lightweight, state-of-the-art open models, ranging in scale from 2 billion to 27 billion parameters. In this new version, we apply several known technical modifications to the Transformer architecture, such as interleaving local-global attentions (Beltagy et al., 2020a) and group-query attention (Ainslie et al., 2023). We also train the 2B and 9B models with knowledge distillation (Hinton et al., 2015) instead of next token prediction. The resulting models deliver the best performance for their size, and even offer competitive alternatives to models that are 2-3 times bigger. We release all our models to the community.

CLJul 7, 2025
Gemini 2.5: Pushing the Frontier with Advanced Reasoning, Multimodality, Long Context, and Next Generation Agentic Capabilities

Gheorghe Comanici, Eric Bieber, Mike Schaekermann et al. · amazon-science, baidu

In this report, we introduce the Gemini 2.X model family: Gemini 2.5 Pro and Gemini 2.5 Flash, as well as our earlier Gemini 2.0 Flash and Flash-Lite models. Gemini 2.5 Pro is our most capable model yet, achieving SoTA performance on frontier coding and reasoning benchmarks. In addition to its incredible coding and reasoning skills, Gemini 2.5 Pro is a thinking model that excels at multimodal understanding and it is now able to process up to 3 hours of video content. Its unique combination of long context, multimodal and reasoning capabilities can be combined to unlock new agentic workflows. Gemini 2.5 Flash provides excellent reasoning abilities at a fraction of the compute and latency requirements and Gemini 2.0 Flash and Flash-Lite provide high performance at low latency and cost. Taken together, the Gemini 2.X model generation spans the full Pareto frontier of model capability vs cost, allowing users to explore the boundaries of what is possible with complex agentic problem solving.

ROFeb 7, 2022
Feedback Linearization Based Tracking Control of A Tilt-rotor with Cat-trot Gait Plan

Zhe Shen, Yudong Ma, Takeshi Tsuchiya

With the introduction of the laterally bounded forces, the tilt-rotor gains more flexibility in the controller design. Typical feedback linearization methods utilize all the inputs in controlling this vehicle; the magnitudes as well as the directions of the thrusts are maneuvered simultaneously based on a unified control rule. Although several promising results indicate that these controllers may track the desired complicated trajectories, the tilting angles are required to change relatively fast or in large scale during the flight, which turns to be a challenge in application. The recent gait plan for a tilt-rotor may solve this problem; the tilting angles are fixed or vary in a predetermined pattern without being maneuvered by the control algorithm. Carefully avoiding the singular decoupling matrix, several attitudes can be tracked without changing the tilting angles frequently. While the position was not directly regulated in that research, which left the position-tracking still an open question. In this research, we elucidate the coupling relationship between the position and the attitude. Based on this, we design the position-tracking controller, adopting feedback linearization. A cat-trot gait is further designed for a tilt-rotor to track the reference; three types of references are designed for our tracking experiments: setpoint, uniform rectilinear motion, and uniform circular motion. The significant improvement with less steady state error is witnessed after equipping with our modified attitude-position decoupler. It is also found that the frequency of the cat-trot gait highly influenced the steady state error.

ROJan 5, 2022
Gait Analysis for A Tilt-rotor: The Dynamic Invertible Gait

Zhe Shen, Takeshi Tsuchiya

Conventional Feedback-Linearization-based controller, applied to the tilt-rotor (eight inputs), results in the extensive changes in the tilting angles, which are not expected in practice. To solve this problem, we introduce the novel concept UAV gait to restrict the tilting angles. The gait plan was initially to solve the control problems for quadruped (four-legged) robots. Transplanting this approach, accompanied by feedback linearization, to the tiltrotor may cause the well-known non-invertible problem in the decoupling matrix. In this research, we explore the invertible gait for the tiltrotor and apply feedback linearization to stabilize the attitude and the altitude. The equivalent conditions to achieve a full-rank decoupling matrix are deduced and simplified to a near zero roll and zero pitch. This paper proposed several invertible gaits to conduct the attitude-altitude control test. The accepted gaits within the region of interest are visualized. The experiment is simulated in Simulink, MATLAB. The results show the promising response in attitude and altitude.

ROAug 19, 2021
Quad-cone-rotor: A Novel Tilt Quadrotor with Severe-fault-tolerant Ability

Zhe Shen, Yudong Ma, Takeshi Tsuchiya

Conventional quadrotors received great attention in trajectory design and fault-tolerant control in these years. The direction of each thrust is perpendicular to the body because of the geometrics in mechanical design. Comparing with the conventional quadrotor, a novel quadrotor named quad-tilt-rotor brings better freedom in manipulating the thrust vector. Quad-tilt-rotor augments the additional degrees of freedom in the thrust, providing the possibility of violating the normal direction of the thrust in the conventional quadrotor. This provides the ability of greater agility in control. This paper presents a novel design of a quad-tilt-rotor (quad-cone-rotor) whose thrust can be assigned along the edge of a cone shape. Besides the inheriting merits in agile from quad-tilt-rotor, the quad-cone-rotor is expected to take fault-tolerant control in severe dynamic failure (total loss in all thrusts). We simulate the control result in a UAV simulator in SIMULINK, MATLAB.

CVApr 9, 2020
Identification of splicing edges in tampered image based on Dichromatic Reflection Model

Zhe Shen, Peng Sun, Yubo Lang et al.

Imaging is a sophisticated process combining a plenty of photovoltaic conversions, which lead to some spectral signatures beyond visual perception in the final images. Any manipulation against an original image will destroy these signatures and inevitably leave some traces in the final forgery. Therefore we present a novel optic-physical method to discriminate splicing edges from natural edges in a tampered image. First, we transform the forensic image from RGB into color space of S and o1o2. Then on the assumption of Dichromatic Reflection Model, edges in the image are discovered by composite gradient and classified into different types based on their different photometric properties. Finally, splicing edges are reserved against natural ones by a simple logical algorithm. Experiment results show the efficacy of the proposed method.