Abstract
Assessment of minimally invasive surgical skills is a non-trivial task, usually requiring the presence and time of expert observers, including subjectivity and requiring special and expensive equipment and software. Although there are virtual simulators that provide self-assessment features, they are limited as the trainee loses the immediate feedback from realistic physical interaction. The physical training boxes, on the other hand, preserve the immediate physical feedback, but lack the automated self-assessment facilities. This study develops an algorithm for real-time tracking of laparoscopy instruments in the video cues of a standard physical laparoscopy training box with a single fisheye camera. The developed visual tracking algorithm recovers the 3D positions of the laparoscopic instrument tips, to which simple colored tapes (markers) are attached. With such system, the extracted instrument trajectories can be digitally processed, and automated self-assessment feedback can be provided. In this way, both the physical interaction feedback would be preserved and the need for the observance of an expert would be overcome. Real-time instrument tracking with a suitable assessment criterion would constitute a significant step towards provision of real-time (immediate) feedback to correct trainee actions and show them how the action should be performed. This study is a step towards achieving this with a low cost, automated, and widely applicable laparoscopy training and assessment system using a standard physical training box equipped with a fisheye camera.
Introduction
Laparoscopy is a minimal invasive surgery performed in the abdominal cavity with the most important advantage of fast recovery of patients, compared to conventional open surgery procedures. Using only small incisions, the surgeon can perform an operation such as removing parts on organs or retrieving tissue samples for further analysis, without fully opening the abdomen (). However, this method brings new challenges to the surgeon as it is more difficult to perform than a conventional open surgery. The main challenges are a reduced field of view due to the use of a single camera, loss of depth perception, less sensitive force perception, and inverted motions due to a rotation around the insertion point (fulcrum effect) (; ). Exemplary camera views of a suturing training from the inside of the training box used in this study are seen in Figure 1, as adapted from our previous work (). To adapt to those challenges a surgeon must carry out an intensive training, which is difficult to be objectively assessed due to the lack of consistent quantitative measures ().
FIGURE 1
Laparoscopy training aims at motor learning (
For training and assessment of laparoscopy skills there are physical box trainers (
A promising approach that has been appearing in the recent years is to equip physical training boxes with machine vision and intelligence to assess the physical performance of the trainee (
In this study, we apply our algorithm to the videos of training sessions for intra-corporeal wound suturing, which is considered to be one of the most difficult procedures in laparoscopy training (
Tracking methods for objects in known environments are well known in the literature and have already been used in several studies on laparoscopy, such as 2D tooltip location tracking in laparoscopy training videos for eye-hand coordination analysis (
The rest of the paper is organized as follows. Our implementation of the real-time tracking in 2D images using color-based markers is presented in Marker Corner Detection in 2D Images. In Tool Tip Position Tracking In 3D, we explain the method used for 3D Cartesian position estimation in real-time. In Testing And Verification, we compare the performance of trajectory extraction with respect to the ground truth trajectories generated by a Robotic Surgery Trainer setup incorporating two UR3 universal Robots. Conclusion concludes the paper.
Marker Corner Detection in 2D Images
Instrument detection is realized by tracking the colored tapes attached to the end of the two instruments as in Figure 1 and Figure 2. The colors of the tapes are chosen to be easily separable from the background (usually a pink colored suturing pad) and each other in a Hue Saturation Value (HSV) space. The tracking problem in this setting can be described as subject to a close-to-invariant light exposure (closed environment and short time recording). The image processing techniques used in this study are individually well known in literature; therefore, we will only mention them briefly without detailed explanations. We note that, what we have performed in this study is adapting these techniques and integrating them effectively to solve the specific detection challenges in the context of a laparoscopy training practice. For example, and specifically, the method we have developed allows detection of the corners of a markers even when some parts of the marker are separated from each other in the image, which can happen in two different cases: when the rope is wrapped around the instrument over the tracked marker as in Figure 1B, and when one of the instruments obscures part of the tracked marker on the other instrument as in Figure 1C. The real-time tracking process is realized using a two steps method, a detection step where the four corners of a colored tape on each instrument are found in the current 2D image frame (explained in the following sub-sections) and a tracking step where a 3D position of the instrument tips are generated using the detected corners (explained in TOOL TIP POSITION TRACKING IN 3D).
FIGURE 2

Practical workspace for both instruments based on the collected data.
Preprocessing
Using the recorded video retrieved from our experiments, an HSV color database is constructed for the range of the pink background pad and for the range of the colors of the tapes on the instruments, across 20 videos recorded from six professional and 10 novice subjects. The HSV range for the tapes is identified by isolating 50 × 50 square regions containing each tape. The mean HSV values are then extracted and used to create the database to be compared to new inputs (
The second part of the preprocessing is a full frame detection. Using color space conversion (cvtColor in OpenCV) on the full image is time consuming; therefore, it is performed only on the first two video frames. The marker positions in the image are retrieved and the center of gravity of each detected contour around the marker is used to estimate the position of the contour in the next frame using the motion gradient. Finally, the extrinsic parameters of the camera are retrieved using the perspective transformation matrix based on the pink pad background corners in the very first frame. Then the Euler angles representing the camera orientation are extracted, as the camera height being adjustable and hence might change across the use of the system in different times. In our setup this initialization is applied automatically every time the system is turned on.
Our detection process follows a general framework for HSV object detection (
FIGURE 3

Sample ROI squares around the markers.
In order to apply a region of interest with the dynamic size detection process, some specific cases must be defined and handled properly to avoid wrong detection and thus losing the instruments. The specific cases are identified as follows:
• The area is not consistent with the previous detection.
• The velocity of the instrument is not realistic.
• The detection algorithm could not find a set of four corners in the previous ROI.
If any of these special cases is detected, the next position estimation is rendered incorrect and the ROI cannot be computed, thus a search of the instrument is applied on a larger part of the image. This detection also is not made on the full image. A notion of dead space is identified based on the recorded dataset which leads to identification of a “practical workspace” for the instruments as in Figure 2 and the search is conducted only in this workspace.
One of the challenges with two instrument detection is the crossing event when one of the instruments is obscured or partially obscured by the other (Figure 1C, Figure 4). To deal with such crossing, the separated parts of the overall contour of the obscured marker are detected using Canny detection and then a convex hull is created using the detected parts. This method allows a regrouping of the separated parts to deal with the situation illustrated in Stage 3 of the crossing (Figure 4B). Furthermore, it allows to simplify the representation of the geometry of the contour and thus speeds up the process of finding the corners using Hough transform.
FIGURE 4

(A) Distance between instruments; (B) crossing between instruments.
Another advantage of using a convex hull representation is the following. In order to speed up the detection process, we are directly using the raw (non-flattened) fisheye camera output, thus the shape retrieved before application of the convex hull does not have straight lines. This would result in having a large set of candidate points for corner selection after the Hough transform. This is avoided and the number of the candidate points for the corners is narrowed down by adapting a convex hull. Using the raw feed also results that we cannot directly identify the correct set of four points in the Hough transform output. To overcome this, we flatten the output points from the Hough transform. In this way, we apply flattening only to about 40 points at the end of the detection, instead of approximately 2 million initially in the frame. For flattening, we use the intrinsic parameters of the camera (distortion, focal lengths and focal points) retrieved in advance from a chessboard calibration.
For the corner selection we use the knowledge on the contour pose in the image and the properties of the trapezoidal shape when the cylinders (markers) are viewed from top (Figure 5) where the detected line segments C and B in Figure 6A must remain parallel. This method allows to find the best candidates for the corner points from the list output in the previous steps. The full detection process with image processing can be summarized with the block diagram in Figure 7.
FIGURE 5

Representation of the trapezoidal shapes after flattening the view of the cylinders and the angle of the instruments regarding the camera.
FIGURE 6

(A) Estimated α angle, rotation around Z of the instrument around the Center of Rotation and parameters of the trapezoidal detection; (B) parameters of the real instruments; (C) estimated β angle, rotation around X.
FIGURE 7

Diagram of the detection process with image processing.
Estimation for Missing Corners
In the previously mentioned specific cases, a correct corner detection is not possible with the image processing as explained up to this point. In these cases, a Kalman filter is used to estimate the position of the instruments. The Kalman filter implementation is a standard one where the estimation of the next position of the instrument is based on a corrector and predictor equation (
The second situation that requires the Kalman filter estimation is when one of the instruments goes out of the video frame which can simply be estimated and detected using the minimal distance between the previously detected location of the instrument and the borders of the image frame. For such cases, the Kalman filter estimation is directly used. Finally, the four corners of both instruments estimated by the Kalman Filter are flattened similarly as in the previous sub-section.
Overall Procedure of Detection of Marker Corners
The following are the overall steps of procedure applied for detection of the marker corners in the 2D images as explained in this section:
1. Check whether any of the “specific cases” applies (such as the detection could not find four corners in the previous ROI); if so, perform Pre-processing in the practical workspace (Figure 2); otherwise continue with Step 2.
2. Identify the new ROI (Eq. 1).
3. Apply HSV decomposition in the ROI.
4. Apply Canny edge detection
5. Apply Convex Hull regrouping to obtain a connected contour representing the marker.
6. Apply Hough transform for line detection.
7. Identify the candidate corners and apply flattening at these corner points.
8. Identify the four corners of the marker using the information of geometric relations.
9. Check if any marker is occluded (Equation 2 and Eq. 3) or out of view; if, yes, use Kalman Filter output to estimate the location of the occluded corners and apply flattening at the estimated corners; otherwise stay with the identified corners in Step 9.
10. Output the corner coordinates for depth estimation.
Tool Tip Position Tracking in 3D
In order to track the full 3D position of the tip of the instruments, we first reconstruct the 2D position information from the 2D Camera view following the methods explained in Marker Corner Detection In 2D Images. Afterwards, we estimate the depth using the difference between the computed circumference of the detected marker polygon as seen in the image and the actual circumference of the polygon.
Real-Time 3D Tracking
As seen in Figure 2, there are two instrument tips, each with four degrees of freedom actively controlled by the subject. However, in this study we track only the three degrees of freedom, the translational movements of each instrument, and ignore the rotational movement around the shaft axis. This is because, almost all criteria of performance that apply to instrument movements in laparoscopy training (
The three degrees of freedom translational movement of the tip point can be represented by (or translated into) other movement parameters, possibly some of them defined as rotations around specific axes, such as rotation of the instrument shaft around an axis through the insertion point. In this study, we consider successive elementary transformations with respect to the “current reference frame” constructed after each transformation (
FIGURE 8

Reference frames at the center of rotation and tool tip (
Tracking the tip point of an instrument corresponds to identifying the , , and parameters in the above transformation matrix, which can then be used to find the Cartesian position of the tip of the instrument with respect to the global reference frame R0. Our approach is first to identify the and angles, using the computed depth difference between the front ( edge in Figure 6A) and rear ( edge in Figure 6A) segments of the marker along with the geometric relations as shown in Figure 6. The depth estimation, , is based on the ratio of the perimeter of the detected marker in the 2D image to the actual perimeter of the marker as in Equation (6),where represents the actual perimeter of the marker and represents the computed perimeter from the image. First, we consider the front segment, the edge in Figure 6A and use this as the image correspondence of the actual length of the marker. We already know the ratio between and segments of the actual ma rker in Figure 6B and using this ratio we can compute a length as the image correspondence of the actual edge, . Using these, the perimeter of the rectangle in the image at the location of can be computed as . We can then compute the depth of the rectangle parallel to the ground as located at (Figure 6C). Following the same procedure, we can also compute the depth of the rectangle parallel to the ground as located at . The difference between these two depth values provide us with the depth difference due to inclination, value, indicated in Figure 6C. Knowing the actual width, and the depth difference we can compute the angle. For , the angle between the line connecting the centers of and edges and the y axis of the global reference frame is computed (Figure 6A). Once is known, the actual length of the instrument can be computed considering the visible length in the image and the inclination angle , along with the ratio between the actual length and the visible length in the image when the instrument is straightly aligned parallel to the ground (perpendicular to the camera view).
Testing for Real-Time Processing
In this section we present our analysis of the speed of processing of the overall algorithm in terms of frames-per-second (FPS), with respect to the compression rate we use in streaming the video to the computer and considering the success rate of detection of the corner points of the marker at an instrument-tip.
In order to achieve a fast processing, we use a streaming communication (TCP/IP) between the camera and the computer housing the image processing software. For that purpose, we apply a compression process on the video feed (on the slave side) to ensure fast and smooth streaming prior to tracking (on the master side). The rate of compression for the streaming is a major factor that impacts the overall speed and performance of detection. We use a JPEG compression and Figure 9 presents the results depicting the speed and performance of detection with varying compression rate. As it is observed in this figure, below 60% compression, the speed of processing increases whereas the performance for correct detection decreases monotonically. In this graph, 15% compression seems to be an optimal choice to achieve a sufficiently fast speed (above 50 Hz) and a high rate of correct detection (very close to 100%); therefore, we applied 15% compression throughout the tests presented in the following section.
FIGURE 9

The average speed of image processing to detect the corners of a marker on a single instrument in terms of frames-per-second (upper figure) and the rate of correct detection (lower figure) with respect to varying compression rate of video frames transmitted from the camera to the computer.
Testing and Verification
In our previous study (
FIGURE 10

Robotic Surgery Trainer setup used in this study to test the position accuracy of the real-time tracking algorithm.
In order to compare the instrument tip trajectory recorded by the robot to that estimated by the real-time tracking algorithm, we first transform the trajectory retrieved from the video tracking into the robot base frame. We then synchronize the two datasets as the robot recording frequency is 125 Hz, giving us a larger number of points in the robot trajectory compared to the tracked trajectory on the video. We eliminate the Euclidian distance between the numeric values of robot recorded and tracked trajectories considering the initial and final points of the trajectories, in order to align them as closely as possible. We then apply a zero-padding in frequency domain to equate the sample size of the position data in the two trajectories. Finally, we apply a norm distance measure between the data of every corresponding couple in the two trajectory data sets to find out the maximal distance between the two trajectories.
For this measurement, we again considered the boxes we had used in the previous study (
FIGURE 11

(A) Two sample boxes used to generate the trajectories. (B) The trajectories generated by the robot (red) and tracked by the real-time image processing algorithm (blue) (units: m).
The maximum error between the estimated trajectories compared to the robot recorded trajectories through all the experiments was computed to be 1.5 mm in x, 4 mm in y, and 3 mm in z coordinates along the edges of the boxes as in Figure 11B. This performance is sufficient for our purposes to assess skill level with typical criteria as we applied in (
Conclusion
In this paper a real time 3D instrument trajectory tracking is developed for single camera laparoscopy training boxes. Trajectories extracted in real-time would be useful to perform real-time skill assessment and to provide real-time feedback, immediately as the subject performs unskilled motions. The work here is a first step towards achieving that goal, as it provides the facility for real-time trajectory extraction. The next step to build on this work would be to develop the assessment criteria that would function in real-time and that would be in such a characteristic to provide immediate feedback to the trainee. The criteria that would serve that purpose are yet to be developed and tested. In our previous work (
FIGURE 12

Sample laparoscopy instrument tip-point trajectories (units: cm) as successfully discriminated to belong (A) to a professional and (B) to a novice by our novel assessment criterion based on Linear Discriminant Analysis (LDA) presented in (
Statements
Data availability statement
The datasets presented in this article are not readily available because the topic of the paper is about image processing to track objects in the videos collected and the content of the videos themselves are not necessary for the topic presented. Requests to access the datasets should be directed to Dr. Mustafa Suphi Erden (m.s.erden@hw.ac.uk).
Ethics statement
We have recorded videos from six professional surgeons and ten novice subjects. Ethical approval was acquired from the Ethics Committee of School of Engineering and Physical Sciences at Heriot-Watt University with Ethics Approval number 18/EA/MSE/1 and all participants provided their Informed Consent prior to data collection.
Author contributions
BG has conducted the research icluding data collection from subjects, theoretical development, iplementation, verification, and writing the paper; HT has taken part in preparation of the experimental platfrom, data collection, and editing the paper; BT has contributed to data collection in laparoscopy training centre from surgeons, specifically prepared the experimental environment and recruited surgeons to take part in the experiments; GN has supervised the experimental envirenment and preparation in the laparoscopy training centre; MSE has supervised the overall research, took part in writing and editing the paper, sustained the funding, planned and coordinated the overall research, contact with the surgeons, and experimentation.
Funding
This research was partially funded by EPSRC under the Grant Reference EP/P013872/1.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors, and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/frobt.2021.751741/full#supplementary-material
References
1
AbterS. O.AbdullahN. A. Z. (2017). Colored Image Retrieval Based on Most Used Colors. Int. J. Adv. Comp. Sci. Appl.8 (9), 264–269. 10.14569/IJACSA.2017.080938
2
AggarwalR.MoorthyK.DarziA. (2004). Laparoscopic Skills Training and Assessment. Br. J. Surg.91 (12), 1549–1558. 10.1002/bjs.4816
3
AhlborgL.HedmanL.NisellH.Felländer-TsaiL.EnochssonL.Felländer-TsaiL. (2013). Simulator Training and Non-technical Factors Improve Laparoscopic Performance Among OBGYN Trainees. Acta Obstet. Gynecol. Scand.92 (10), 1194–1201. 10.1111/aogs.12218
4
AhmmadS. N. Z.MingE. S. L.FaiY. C.bin Che HarunF. K., (2011). Assessment Methods for Surgical Skill. World Acad. Sci. Eng. Tech. Int. J. Biomed. Biol. Eng.5 (10), 504–510. 10.5281/zenodo.1058059
5
AllenB. F.KasperF.NataneliG.DutsonE.FaloutsosP. (2011). Visual Tracking of Laparoscopic Instruments in Standard Training Environments. Stud. Health Technol. Inform.163 (18), 11–17. 10.3233/978-1-60750-706-2-11
6
Alonso-SilverioG. A.Pérez-EscamirosaF.Bruno-SanchezR.Ortiz-SimonJ. L.Muñoz-GuerreroR.Minor-MartinezA.et al (2018). Development of a Laparoscopic Box Trainer Based on Open Source Hardware and Artificial Intelligence for Objective Assessment of Surgical Psychomotor Skills. Surg. Innov.25 (4), 380–388. 10.1177/1553350618777045
7
BergD. A.MilnerR. E.FisherC. A.GoldbergA. J.DempseyD. T.GrewalH. (2007). A Cost-Effective Approach to Establishing a Surgical Skills Laboratory. Surgery142, 712–721. 10.1016/j.surg.2007.05.011
8
ChangO. H.KingL. P.ModestA. M.HurH.-C. (2016). Developing an Objective Structured Assessment of Technical Skills for Laparoscopic Suturing and Intracorporeal Knot Tying. J. Surg. Edu.73 (2), 258–263. 10.1016/j.jsurg.2015.10.006
9
ChenS. Y. (2012). Kalman Filter for Robot Vision: A Survey. IEEE Trans. Ind. Electron.59 (11), 4409–4420. 10.1109/tie.2011.2162714
10
ChmarraM. K.GrimbergenC. A.DankelmanJ. (2007). Systems for Tracking Minimally Invasive Surgical Instruments. Minimally Invasive Ther. Allied Tech.16 (6), 328–340. 10.1080/13645700701702135
11
ChmarraM. K.JansenF. W.GrimbergenC. A.DankelmanJ. (2008). Retracting and Seeking Movements during Laparoscopic Goal-Oriented Movements. Is the Shortest Path Length Optimal?Surg. Endosc.22, 943–949. 10.1007/s00464-007-9526-z
12
CraigJ., (2005). Introduction to Robotics – Mechanics and Control. 3rd Edition. New Jersey, US, Pearson Prentice Hall, 19–61.
13
CucchiaraR.GranaC.PiccardiM.PratiA.SirottiS., (2001). “Improving Shadow Suppression in Moving Object Detection with HSV Color Information”, Proc. of IEEE Intelligent Transportation Systems Conference (ITSC'01), Oakland, CA, USA, Aug 25-Aug 29 2001, pp. 334–339. 10.1109/ITSC.2001.948679
14
DankelmanJ.ChmarraM. K.VerdaasdonkE. G.StassenL. P.GrimbergenC. A. (2005). Fundamental Aspects of Learning Minimally Invasive Surgical Skills. Minim. Invasive Ther. Allied Technol.14 (4-5), 247–256. 10.1080/13645700500272413
15
DattaV.MackayS.MandaliaM.DarziA. (2001). The Use of Electromagnetic Motion Tracking Analysis to Objectively Measure Open Surgical Skill in the Laboratory-Based Model 1 1No Competing Interests Declared. J. Am. Coll. Surgeons193 (5), 479–485. 10.1016/s1072-7515(01)01041-9
16
De PaolisL. T.RicciardiF.GiulianiF. (2014). “Development of a Serious Game for Laparoscopic Suture Training,” in Augmented and Virtual Reality. Editors De PaolisL. T.MongelliA. (Cham: Springer International Publishing), 90–102. 10.1007/978-3-319-13969-2_7
17
DoignonC.NageotteF.de MathelinM. (2006). “The Role of Insertion Points in the Detection and Positioning of Instruments in Laparoscopy for Robotic Tasks,” in MICCAI 2006, LNCS. Editors LarsenR.NielsenM.SporringJ. (Berlin: Springer), Vol. 4190, 527–534. 10.1007/11866565_65
18
EstradaS.DuranC.SchulzD.BismuthJ.ByrneM. D.O'MalleyM. K. (2016). Smoothness of Surgical Tool Tip Motion Correlates to Skill in Endovascular Tasks. IEEE Trans. Human-mach. Syst.46 (5), 647–659. 10.1109/thms.2016.2545247
19
EyvazzadehD.KavicS. M. (2011). Defining "Laparoscopy" through Review of Technical Details in JSLS. JSLS15, 151–153. 10.4293/108680811x13022985131895
20
GautierB.TugalH.TangB.NabiG.ErdenM. S. (2019). Laparoscopy Instrument Tracking for Single View Camera and Skill Assessment, International Conference on Robotics and Automation (ICRA), Montreal, Canada, May 20-May 24 2019, Palais des congres de Montreal.
21
GrecoE. F.RegehrG.OkrainecA. (2010). Identifying and Classifying Problem Areas in Laparoscopic Skills Acquisition: Can Simulators Help?Acad. Med.85 (10), S5–S8. 10.1097/acm.0b013e3181ed4107
22
GuttC. N.KimZ.-G.KrähenbühlL. (2002). Training for Advanced Laparoscopic Surgery. Eur. J. Surg.168, 172–177. 10.1080/110241502320127793
23
HalsbandU.LangeR. K. (2006). Motor Learning in Man: a Review of Functional and Clinical Studies. J. Physiology-Paris99, 414–424. 10.1016/j.jphysparis.2006.03.007
24
HamudaE.Mc GinleyB.GlavinM.JonesE. (2017). Automatic Crop Detection under Field Conditions Using the HSV Colour Space and Morphological Operations. Comput. Electro. Agric.133, 97–107. 10.1016/j.compag.2016.11.021
25
HudgensJ. L.PasicR. P. (2015). Fundamentals of Geometric Laparoscopy and Suturing. Germany: Endo-Press GmbH.
26
JiangX.ZhengB.AtkinsM. S. (2014). Video Processing to Locate the Tooltip Position in Surgical Eye-Hand Coordination Tasks. Surg. Innov.22 (3), 285–293. 10.1177/1553350614541859
27
JudkinsT. N.OleynikovD.NarazakiK.StergiouN. (2006). Robotic Surgery and Training: Electromyographic Correlates of Robotic Laparoscopic Training. Surg. Endosc.20 (5), 824–829. 10.1007/s00464-005-0334-z
28
KroezeS. G. C.MayerE. K.ChopraS.AggarwalR.DarziA.PatelA. (2009). Assessment of Laparoscopic Suturing Skills of Urology Residents: A Pan-European Study. Eur. Urol.56 (5), 865–873. 10.1016/j.eururo.2008.09.045
29
KunertW.StorzP.DietzN.AxtS.FalchC.KirschniakA.et al (2020). Learning Curves, Potential and Speed in Training of Laparoscopic Skills: a Randomised Comparative Study in a Box Trainer. Surg. Endosc.35, 3303–3312. 10.1007/s00464-020-07768-1
30
LahanasV.LoukasC.SmailisN.GeorgiouE. (2015). A Novel Augmented Reality Simulator for Skills Assessment in Minimal Invasive Surgery. Surg. Endosc.29, 2224–2234. 10.1007/s00464-014-3930-y
31
LevyB.MobasheriM. (2017). Principles of Safe Laparoscopic Surgery. Surgery (Oxford)35 (4), 216–219. 10.1016/j.mpsur.2017.01.010
32
LinB.SunY.QianX.GoldgofD.GitlinR.YouY. (2016). Video‐based 3D Reconstruction, Laparoscope Localization and Deformation Recovery for Abdominal Minimally Invasive Surgery: a Survey. Int. J. Med. Robotics Comput. Assist. Surg.12, 158–178. 10.1002/rcs.1661
33
MegaliG.SinigagliaS.TonetO.DarioP. (2006). Modelling and Evaluation of Surgical Performance Using Hidden Markov Models. IEEE Trans. Biomed. Eng.53 (10), 1911–1919. 10.1109/tbme.2006.881784
34
MoorthyK.MunzY.DosisA.BelloF.DarziA. (2003). Motion Analysis in the Training and Assessment of Minimally Invasive Surgery. Minim. Invasive Ther. Allied Technol.12 (3-4), 137–142. 10.1080/13645700310011233
35
NguyenH. T.SmeuldersA. W. M. (2004). Fast Occluded Object Tracking by a Robust Appearance Filter. IEEE Trans. Pattern Anal. Machine Intell.26 (8), 1099–1104. 10.1109/tpami.2004.45
36
Pérez-EscamirosaF.OropesaI.Sánchez-GonzálezP.Tapia-JuradoJ.Ruiz-LizarragaJ.Minor-MartínezA. (2018). Orthogonal Cameras System for Tracking of Laparoscopic Instruments in Training Environments. Cirugia y Cirujanos86, 548–555. 10.24875/CIRU.18000348
37
RetrosiG.CundyT.HaddadM.ClarkeS. (2015). Motion Analysis-Based Skills Training and Assessment in Pediatric Laparoscopy: Construct, Concurrent, and Content Validity for the eoSim Simulator. J. Laparoendoscopic Adv. Surg. Tech.25 (11), 944–950. 10.1089/lap.2015.0069
38
RiazM.PankooK.JonganP., (2009). “Extracting Color Using Adaptive Segmentation for Image Retrieval”, Proc. of 2nd International Joint Conference on Computational Sciences and Optimization (CSO), Sanya, China, April 24-April 26 2009. 10.1109/cso.2009.290
39
Sánchez-MargalloJ. A.Sánchez-MargalloF. M.PagadorJ. B.GómezE. J.Sánchez-GonzálezP.UsónJ.et al (2011). Video-based Assistance System for Training in Minimally Invasive Surgery. Minimally Invasive Ther. Allied Tech.20 (4), 197–205. 10.3109/13645706.2010.534243
40
SchreuderH.van den BergC.HazebroekE.VerheijenR.SchijvenM. (2011). Laparoscopic Skills Training Using Inexpensive Box Trainers: Which Exercises to Choose when Constructing a Validated Training Course. BJOG-An Int. J. af Obstet. Gynaecol.118, 1576–1584. 10.1111/j.1471-0528.2011.03146.x
41
StrandbygaardJ.BjerrumF.MaagaardM.WinkelP.LarsenC. R.RingstedC.et al (2013). Instructor Feedback versus No Instructor Feedback on Performance in a Laparoscopic Virtual Reality Simulator. Ann. Surg.257 (5), 839–844. 10.1097/sla.0b013e31827eee6e
42
UlrichA. P.ChoM. Y.LamC.LernerV. T. (2020). A Low-Cost Platform for Laparoscopic Simulation Training. Obstet. Gynecol.136 (1), 77–82. 10.1097/aog.0000000000003920
43
VassiliouM. C.FeldmanL. S.AndrewC. G.BergmanS.LeffondréK.StanbridgeD.et al (2005). A Global Assessment Tool for Evaluation of Intraoperative Laparoscopic Skills. Am. J. Surg.190 (1), 107–113. 10.1016/j.amjsurg.2005.04.004
44
von WebskyM. W.RaptisD. A.VitzM.RosenthalR.ClavienP. A.HahnloserD. (2013). Access to a Simulator Is Not Enough: The Benefits of Virtual Reality Training Based on Peer-Group-Derived Benchmarks-A Randomized Controlled Trial. World J. Surg.37 (11), 2534–2541. 10.1007/s00268-013-2175-6
45
VorosS.OrvainE.CinquinP.LongJ. A. (2006). “Automatic Detection of Instruments in Laparoscopic Images: a First Step towards High Level Command of Robotized Endoscopic Holders,” in IEEE/RAS-EMBS International Conference on Biomedical Robotics and Biomechatronics (BioRob), Pisa, Italy, Feb 20-Feb 22 2006. 10.1109/BIOROB.2006.1639240
46
WangH.WangS.DingJ.LuoH. (2010). Suturing and Tying Knots Assisted by a Surgical Robot System in Laryngeal Mis. Robotica28 (2), 241–252. 10.1017/s0263574709990622
47
XinH.ZelekJ. S.CarnahanH., “Laparoscopic Surgery, Perceptual Limitations and Force : A Review,” First Canadian Student Conference on Biomedical Computing, Kingston, ON, 2006.
48
ZhangF.WangZ.ChenW.HeK.WangY.LiuY. H. (2020). Hand-Eye Calibration of Surgical Instrument for Robotic Surgery Using Interactive Manipulation. IEEE Robotics Automation Lett.5 (2), 1540–1547. 10.1109/LRA.2020.2967685
49
ZhangX.PayandehS. (2002). Application of Visual Tracking for Robot-Assisted Laparoscopic Surgery. J. Robotic Syst.19, 315–328. 10.1002/rob.10043
Summary
Keywords
real-time motion tracking, cartesian position estimation, single view camera, skill metric, laparacospy, laparoscopy training
Citation
Gautier B, Tugal H, Tang B, Nabi G and Erden MS (2021) Real-Time 3D Tracking of Laparoscopy Training Instruments for Assessment and Feedback. Front. Robot. AI 8:751741. doi: 10.3389/frobt.2021.751741
Received
01 August 2021
Accepted
13 October 2021
Published
04 November 2021
Volume
8 - 2021
Edited by
Troy McDaniel, Arizona State University, United States
Updates

Check for updates
Copyright
© 2021 Gautier, Tugal, Tang, Nabi and Erden.
This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Mustafa Suphi Erden, m.s.erden@hw.ac.uk
This article was submitted to Biomedical Robotics, a section of the journal Frontiers in Robotics and AI
Disclaimer
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.