Recent advancements in surgical computer vision applications have been driven by fully-supervised methods, primarily using only visual data. These methods rely on manually annotated surgical videos to predict a fixed set of object categories, limiting their generalizability to unseen surgical procedures and downstream tasks. In this work, we put forward the idea that the surgical video lectures available through open surgical e-learning platforms can provide effective supervisory signals for multi-modal representation learning without relying on manual annotations. We address the surgery-specific linguistic challenges present in surgical video lectures by employing multiple complementary automatic speech recognition systems to generate text...
International audienceSurgical process analysis and modeling is a recent and important topic aiming ...
Recently a number of studies demonstrated impressive performance on diverse vision-language multimod...
In medical imaging, manual annotations can be expensive to acquire and sometimes infeasible to acces...
Recent advancements in surgical computer vision applications have been driven by fully-supervised me...
In the medical field, due to their economic and clinical benefits, there is a growing interest in mi...
Self-supervised learning has witnessed great progress in vision and NLP; recently, it also attracted...
PURPOSE: Automatic surgical instruction generation is a crucial part for intra-operative surgical as...
Out of all existing frameworks for surgical workflow analysis in endoscopic videos, action triplet r...
Abstract—Previous field studies show that surgery residents and medical students have difficulty rec...
International audiencePurpose: Automatic recognition of surgical activities from intraoperative surg...
Abstract. In recent years, surgical simulation has emerged at the forefront of new technologies for ...
International audienceNowadays, many surgeries, including eye surgeries, are video-monitored. We pre...
International audienceSurgical process analysis and modeling is a recent and important topic aiming ...
Recently a number of studies demonstrated impressive performance on diverse vision-language multimod...
In medical imaging, manual annotations can be expensive to acquire and sometimes infeasible to acces...
Recent advancements in surgical computer vision applications have been driven by fully-supervised me...
In the medical field, due to their economic and clinical benefits, there is a growing interest in mi...
Self-supervised learning has witnessed great progress in vision and NLP; recently, it also attracted...
PURPOSE: Automatic surgical instruction generation is a crucial part for intra-operative surgical as...
Out of all existing frameworks for surgical workflow analysis in endoscopic videos, action triplet r...
Abstract—Previous field studies show that surgery residents and medical students have difficulty rec...
International audiencePurpose: Automatic recognition of surgical activities from intraoperative surg...
Abstract. In recent years, surgical simulation has emerged at the forefront of new technologies for ...
International audienceNowadays, many surgeries, including eye surgeries, are video-monitored. We pre...
International audienceSurgical process analysis and modeling is a recent and important topic aiming ...
Recently a number of studies demonstrated impressive performance on diverse vision-language multimod...
In medical imaging, manual annotations can be expensive to acquire and sometimes infeasible to acces...