LGSep 22, 2022
Turning Normalizing Flows into Monge Maps with Geodesic Gaussian Preserving FlowsGuillaume Morel, Lucas Drumetz, Simon Benaïchouche et al.
Normalizing Flows (NF) are powerful likelihood-based generative models that are able to trade off between expressivity and tractability to model complex densities. A now well established research avenue leverages optimal transport (OT) and looks for Monge maps, i.e. models with minimal effort between the source and target distributions. This paper introduces a method based on Brenier's polar factorization theorem to transform any trained NF into a more OT-efficient version without changing the final density. We do so by learning a rearrangement of the source (Gaussian) distribution that minimizes the OT cost between the source and the final density. We further constrain the path leading to the estimated Monge map to lie on a geodesic in the space of volume-preserving diffeomorphisms thanks to Euler's equations. The proposed method leads to smooth flows with reduced OT cost for several existing models without affecting the model performance.
ROOct 27, 2025
Localising under the drape: proprioception in the era of distributed surgical robotic systemMartin Huber, Nicola A. Cavalcanti, Ayoob Davoodi et al.
Despite their mechanical sophistication, surgical robots remain blind to their surroundings. This lack of spatial awareness causes collisions, system recoveries, and workflow disruptions, issues that will intensify with the introduction of distributed robots with independent interacting arms. Existing tracking systems rely on bulky infrared cameras and reflective markers, providing only limited views of the surgical scene and adding hardware burden in crowded operating rooms. We present a marker-free proprioception method that enables precise localisation of surgical robots under their sterile draping despite associated obstruction of visual cues. Our method solely relies on lightweight stereo-RGB cameras and novel transformer-based deep learning models. It builds on the largest multi-centre spatial robotic surgery dataset to date (1.4M self-annotated images from human cadaveric and preclinical in vivo studies). By tracking the entire robot and surgical scene, rather than individual markers, our approach provides a holistic view robust to occlusions, supporting surgical scene understanding and context-aware control. We demonstrate an example of potential clinical benefits during in vivo breathing compensation with access to tissue dynamics, unobservable under state of the art tracking, and accurately locate in multi-robot systems for future intelligent interaction. In addition, and compared with existing systems, our method eliminates markers and improves tracking visibility by 25%. To our knowledge, this is the first demonstration of marker-free proprioception for fully draped surgical robots, reducing setup complexity, enhancing safety, and paving the way toward modular and autonomous robotic surgery.
ROApr 13, 2018
Mechanical design of a distal scanner for confocal microlaparoscope: A conic solutionMustafa Suphi Erden, Benoît Rosa, Jérome Szewczyk et al.
This paper presents the mechanical design of a distal scanner to perform a spiral scan for mosaic-imaging with a confocal microlaparoscope. First, it is demonstrated with ex vivo experiments that a spiral scan performs better than a raster scan on soft tissue. Then a mechanical design is developed in order to perform the spiral scan. The design in this paper is based on a conic structure with a particular curved surface. The mechanism is simple to implement and to drive; therefore, it is a low-cost solution. A 5:1 scale prototype is implemented by rapid prototyping and the requirements are validated by experiments. The experiments include manual and motor drive of the system. The manual drive demonstrates the resulting spiral motion by drawing the tip trajectory with an attached pencil. The motor drive demonstrates the speed control of the system with an analysis of video thread capturing the trajectory of a laser beam emitted from the tip.