CVDec 16, 2024
Can video generation replace cinematographers? Research on the cinematic language of generated videoXiaozhe Li, Kai WU, Siyi Yang et al.
Recent advancements in text-to-video (T2V) generation have leveraged diffusion models to enhance visual coherence in videos synthesized from textual descriptions. However, existing research primarily focuses on object motion, often overlooking cinematic language, which is crucial for conveying emotion and narrative pacing in cinematography. To address this, we propose a threefold approach to improve cinematic control in T2V models. First, we introduce a meticulously annotated cinematic language dataset with twenty subcategories, covering shot framing, shot angles, and camera movements, enabling models to learn diverse cinematic styles. Second, we present CameraDiff, which employs LoRA for precise and stable cinematic control, ensuring flexible shot generation. Third, we propose CameraCLIP, designed to evaluate cinematic alignment and guide multi-shot composition. Building on CameraCLIP, we introduce CLIPLoRA, a CLIP-guided dynamic LoRA composition method that adaptively fuses multiple pre-trained cinematic LoRAs, enabling smooth transitions and seamless style blending. Experimental results demonstrate that CameraDiff ensures stable and precise cinematic control, CameraCLIP achieves an R@1 score of 0.83, and CLIPLoRA significantly enhances multi-shot composition within a single video, bridging the gap between automated video generation and professional cinematography.\textsuperscript{1}
CVJul 11, 2021
A Cloud-Edge-Terminal Collaborative System for Temperature Measurement in COVID-19 PreventionZheyi Ma, Hao Li, Wen Fang et al.
To prevent the spread of coronavirus disease 2019 (COVID-19), preliminary temperature measurement and mask detection in public areas are conducted. However, the existing temperature measurement methods face the problems of safety and deployment. In this paper, to realize safe and accurate temperature measurement even when a person's face is partially obscured, we propose a cloud-edge-terminal collaborative system with a lightweight infrared temperature measurement model. A binocular camera with an RGB lens and a thermal lens is utilized to simultaneously capture image pairs. Then, a mobile detection model based on a multi-task cascaded convolutional network (MTCNN) is proposed to realize face alignment and mask detection on the RGB images. For accurate temperature measurement, we transform the facial landmarks on the RGB images to the thermal images by an affine transformation and select a more accurate temperature measurement area on the forehead. The collected information is uploaded to the cloud in real time for COVID-19 prevention. Experiments show that the detection model is only 6.1M and the average detection speed is 257ms. At a distance of 1m, the error of indoor temperature measurement is about 3%. That is, the proposed system can realize real-time temperature measurement in public areas.
CVApr 19, 2020
Lightweight Mask R-CNN for Long-Range Wireless Power Transfer SystemsHao Li, Aozhou Wu, Wen Fang et al.
Resonant Beam Charging (RBC) is a wireless charging technology which supports multi-watt power transfer over meter-level distance. The features of safety, mobility and simultaneous charging capability enable RBC to charge multiple mobile devices safely at the same time. To detect the devices that need to be charged, a Mask R-CNN based dection model is proposed in previous work. However, considering the constraints of the RBC system, it's not easy to apply Mask R-CNN in lightweight hardware-embedded devices because of its heavy model and huge computation. Thus, we propose a machine learning detection approach which provides a lighter and faster model based on traditional Mask R-CNN. The proposed approach makes the object detection much easier to be transplanted on mobile devices and reduce the burden of hardware computation. By adjusting the structure of the backbone and the head part of Mask R-CNN, we reduce the average detection time from $1.02\mbox{s}$ per image to $0.6132\mbox{s}$, and reduce the model size from $245\mbox{MB}$ to $47.1\mbox{MB}$. The improved model is much more suitable for the application in the RBC system.
SYSep 25, 2018
Adaptive Resonant Beam Charging for Intelligent Wireless Power TransferQingqing Zhang, Wen Fang, Mingliang Xiong et al.
As a long-range high-power wireless power transfer (WPT) technology, resonant beam charging (RBC) can transmit Watt-level power over long distance for the devices in the internet of things (IoT). Due to its open-loop architecture, RBC faces the challenge of providing dynamic current and voltage to optimize battery charging performance. In RBC, battery overcharge may cause energy waste, thermal effects, and even safety issues. On the other hand, battery undercharge may lead to charging time extension and significant battery capacity reduction. In this paper, we present an adaptive resonant beam charging (ARBC) system for battery charging optimization. Based on RBC, ARBC uses a feedback system to control the supplied power dynamically according to the battery preferred charging values. Moreover, in order to transform the received current and voltage to match the battery preferred charging values, ARBC adopts a direct current to direct current (DC-DC) conversion circuit. Relying on the analytical models for RBC power transmission, we obtain the end-to-end power transfer relationship in the approximate linear closed-form of ARBC. Thus, the battery preferred charging power at the receiver can be mapped to the supplied power at the transmitter for feedback control. Numerical evaluation demonstrates that ARBC can save 61% battery charging energy and 53%-60% supplied energy compared with RBC. Furthermore, ARBC has high energy-saving gain over RBC when the WPT is unefficient. ARBC in WPT is similar to link adaption in wireless communications. Both of them play the important roles in their respective areas.
SYSep 25, 2018
Fair Scheduling in Resonant Beam Charging for IoT DevicesWen Fang, Qingqing Zhang, Qingwen Liu et al.
Resonant Beam Charging (RBC) is the Wireless Power Transfer (WPT) technology, which can provide high-power, long-distance, mobile, and safe wireless charging for Internet of Things (IoT) devices. Supporting multiple IoT devices charging simultaneously is a significant feature of the RBC system. To optimize the multi-user charging performance, the transmitting power should be scheduled for charging all IoT devices simultaneously. In order to keep all IoT devices working as long as possible for fairness, we propose the First Access First Charge (FAFC) scheduling algorithm. Then, we formulate the scheduling parameters quantitatively for algorithm implementation. Finally, we analyze the performance of FAFC scheduling algorithm considering the impacts of the receiver number, the transmitting power and the charging time. Based on the analysis, we summarize the methods of improving the WPT performance for multiple IoT devices, which include limiting the receiver number, increasing the transmitting power, prolonging the charging time and improving the single-user's charging efficiency. The FAFC scheduling algorithm design and analysis provide a fair WPT solution for the multi-user RBC system.
SYSep 25, 2018
Earning Maximization with Quality of Charging Service Guarantee for IoT DevicesWen Fang, Qingqing Zhang, Mingqing Liu et al.
Resonant Beam Charging (RBC) is a promising Wireless Power Transfer (WPT) technology to provide long-range, high-power, mobile and safe wireless power for the Internet of Things (IoT) devices. The Point-to-Multipoint (PtMP) RBC system can charge multiple receivers simultaneously similar to WiFi communications. To guarantee the Quality of Charging Service (QoCS) for each receiver and maximize the overall earning in the PtMP RBC service, we specify the Charging Pricing Strategy (CPS) and develop the High Priority Charge (HPC) scheduling algorithm to control the charging order and power allocation. Each receiver is assigned a priority, which is updated dynamically based on its State of Charging (SOC) and specified charging power. The receivers with high priorities are scheduled to be charged in each time slot. We present the pseudo code of the HPC algorithm based on quantifying the receiver's SOC, discharging energy and various relevant parameters. Relying on simulation analysis, we demonstrate that the HPC algorithm can achieve better QoCS and earning than the Round-Robin Charge (RRC) scheduling algorithm. Based on the performance evaluation, we illustrate that the methods to improve the PtMP RBC service are: 1) limiting the receiver number within a reasonable range and 2) prolonging the charging duration as long as possible. In summary, the HPC scheduling algorithm provides a practical strategy to maximize the earning of the PtMP RBC service with each receiver's QoCS guarantee.