- 1Robotics and Mechanisms Laboratory, Department of Mechanical and Aerospace Engineering, University of California, Los Angeles, CA, United States
- 2The Laboratory for Intelligent Decision and Autonomous Robots, George W. Woodruf School of Mechanical Engineering, Georgia Institute of Technology, Atlanta, GA, United States
This paper presents a comparative study of data-driven acceleration techniques for mixed-integer bilinear programs (MIBLPs) applied to robot motion planning. MIBLPs combine discrete decision variables and nonlinear constraints, making them computationally challenging for real-time robotics applications. We investigate two reformulation strategies: (1) converting binary variables into continuous variables with complementarity constraints (MPCC), and (2) converting bilinear constraints into mixed-integer linear constraints using McCormick envelopes (MICP). Using offline computed solutions as datasets, we apply K-nearest neighbor methods to warm-start both reformulations. We experimented with the proposed data-driven MIBLP formulation for motion planning on a linear inverted pendulum with contacts, and planning motion using a single rigid body model with mode transitions and contacts. Our results demonstrate that when sufficient data is available, MICP achieves consistently fast solving speeds that are suitable for real-time computation, while MPCC achieves higher success rates with limited amount of data. Our approach is capable of planning motions for the SCALER robot platform to transition between bipedal and quadrupedal configurations to navigate around obstacles without pre-specified gaits. Code for reproducing our results is available at https://github.com/XuanLin/MIBLP_benchmark.
1 Introduction
Optimization-based methods are powerful tools for solving robotic motion planning problems. For problems involving discrete decisions and nonlinear constraints, typical approaches include mixed-integer convex programs (MICPs) (Deits and Tedrake, 2014; Lin et al., 2019), which explicitly handle discrete variables using techniques like Branch and Bound (Boyd and Mattingley, 2007) or Benders Decomposition (Geoffrion, 1972; Lin, 2024), and nonlinear programs (NLPs) (Dai et al., 2014; Winkler et al., 2018; Shirai et al., 2020), which deal with nonconvex constraints. Each approach has distinct advantages and limitations when solved with off-the-shelf solvers. MICP solvers can find global optimal solutions without any initial guess but often require prohibitively long solving times for problems with numerous integer variables. NLPs, while computationally more efficient when solved with e.g., interior point methods (Wachter, 2002), tend to converge to local optimal solutions that may exhibit inconsistent behavior sensitive to initial guesses. To introduce discrete variables, NLPs require complementary constraints (Yunt and Glocker, 2006; Posa et al., 2014), which is numerically challenging as it violates the majority of Constraint Qualifications established for standard nonlinear optimization (Luo et al., 1996). Combining MICPs and NLPs leads to mixed-integer nonlinear programs (MINLPs), such as mixed-integer bilinear programs (MIBLPs), which offer greater descriptive power for formulating complex problems but become computationally intractable unless the problem size is very small (Marcucci, 2024). This limits the practical implementation of MINLPs in online applications.
Researchers have been investigating machine learning methods to accelerate optimization solving procedures. Data can be gathered to enhance both general-purpose solvers and problem-specific approaches. For example, MIP solvers such as Gurobi (Gurobi Optimization, LLC, 2024) use branch-and-bound algorithms that heavily rely on cutting planes and heuristics to discover better solutions and eliminate infeasible ones. These techniques used inside solvers can be aided by machine learning methods (Nair et al., 2020; Tang et al., 2020). While these approaches aim to improve general solver performance, they require modifying solver internals and typically demand large-scale training datasets spanning many problem classes. Additionally, recent work has also explored neural network-based warm-starting for trajectory optimization in robotics, such as using diffusion models (Carvalho et al., 2023; Li et al., 2024), constraint-informed learning (Briden et al., 2025), and applications including navigating challenging obstacles in unfamiliar environments (Dalal et al., 2024) and trajectory generation on the International Space Station (Banerjee et al., 2025).
On the other hand, for parametric optimization problems where parameters follow specific distributions, solved problem instances can be collected offline to learn problem-solution mappings that accelerate online solving. In this problem-specific setting, simpler learning schemes such as K-nearest neighbors can be effective. This approach has been investigated for MICPs (Zhu and Martius, 2020; Cauligi et al., 2021), and NLPs (Hauser, 2016). These data-driven methods have demonstrated potential in reducing computation time for complex optimization problems. However, considerably less work has been done on applying these data-driven techniques to mixed-integer nonlinear programs (MINLPs) (Dua and Pistikopoulos, 1999), particularly in comparing different reformulation strategies.
In this paper, we investigate data-driven techniques for accelerating mixed-integer bilinear programs (MIBLPs), a common class of MINLPs, for robotics applications. Rather than developing an entirely new approach, we examine how to reformulate MIBLPs into alternative, more tractable formulations such as MICPs or NLPs, then apply simple-yet-effective KNN-based data-driven methods. We systematically compare the performance of these reformulations in terms of solving speed, success rate, solution quality, and data requirements—a comparative perspective that has received limited attention in prior works.
Several established approaches to reformulate MIBLPs exists in the literature. In this paper, we investigate two widely-accepted formulations:
1. Turning binary variables into continuous variables with complementarity constraints, transforming the problem into a mathematical program with complementarity constraints (MPCC) (Park and Boyd, 2018). This formulation has been used in previous works such as (Posa et al., 2014). This formulation enables the direct application of data-driven methods such as (Hauser, 2016; Lin et al., 2022).
2. Converting bilinear constraints into mixed-integer linear constraints using McCormick envelopes (Gupte et al., 2013), which transforms the problem into an MICP. This formulation has been used in previous work such as (Dai et al., 2019). This formulation enables the direct application of data-driven methods such as (Zhu and Martius, 2020; Cauligi et al., 2021).
Given these reformulation options, we ask a natural question: Given a certain amount of problems solved offline as warm-start data, which re-formulation can solve the original MIBLP faster and more reliably? How much data is needed to achieve such performances?
To answer these questions, we conduct a comparison of different MIBLP reformulations in the context of robotic motion planning and control. We experiment with motion planning problems formulated as MINLPs on two models: a linear inverted pendulum with soft contact walls, and a single rigid body with contact. The inverted pendulum problem allows us to study the performance of reformulations in a lower-dimensional setting, while the single rigid body control problem with contact represents a more complex scenario commonly seen in legged locomotion (Ding et al., 2022) and manipulation (Hogan and Rodriguez, 2020) tasks. In this paper, we use the single rigid body model to study a locomotion problem where the robot must change its mobility mode from biped to quadruped to overcome an obstacle, as illustrated in Figure 1. For each problem, we apply data-driven methods to both the MPCC and MICP reformulations, comparing their success rate, computational speed, and solution quality as the amount of training data increases.
Figure 1. Experimental demonstration: SCALER robot transitions from bipedal to quadrupedal form to navigate beneath an obstacle, then returns to bipedal form to reach an elevated target position (illustrated by a button-pushing task).
To summarize, our contributions are as follows:
1. Formulate two robotics control problems (inverted pendulum and single rigid body with mode transitions and contacts) as MIBLPs and solve them with data-driven methods.
2. Benchmark the data-driven performance of different MIBLP re-formulations across these robotic problems.
A preliminary version of this work was published (Lin et al., 2024). While the conference paper introduced the comparative study of MICP and NLP reformulations for MIBLPs, it was limited to the simplified Book Placement Planning problem without dynamics. The current paper significantly extends this work by evaluating the reformulation strategies on more challenging robotic systems with complex dynamics, including an inverted pendulum and single rigid body with contacts.
2 Related works
2.1 Parametric programming
The field of parametric programming focuses on constructing mappings from optimization problem parameters to their corresponding solutions (Fiacco, 1983; Pistikopoulos et al., 2002). This approach has been explored across various optimization domains, including linear programming (Gal, 2010), quadratic programming (Bemporad et al., 2002), and mixed-integer nonlinear programming (Dua and Pistikopoulos, 1999). A notable application of parametric programming appears in controller design through explicit Model Predictive Control (MPC) (Bemporad et al., 2002; Tøndel et al., 2003), which involves solving MPC problems offline and storing the resulting active constraint sets. During online computation, the current system parameters are used to identify the appropriate pre-computed solution. Bemporad et al. (2002) demonstrated that for constrained linear quadratic regulator problems, these active sets form polyhedrons. Dua and Pistikopoulos (2000) further proposed efficient parameter space partitioning algorithms to construct those polyhedrons.
For non-convex optimization problems, where active sets generally do not form polyhedrons, researchers have developed different approaches. Several studies (Hauser, 2016; Tang and Hauser, 2019; Zhu and Martius, 2020) employ non-parametric learning techniques such as K-nearest neighbor (KNN) to directly leverage stored solution data for generating warm-start solutions online. A more recent research has expanded these methods by incorporating neural networks to learn embeddings that map from larger parameter sets to optimization solutions (Cauligi et al., 2021). A significant advantage of neural network methods is their capability to generalize to out-of-distribution problem instances that lie considerably outside the training dataset (Power and Berenson, 2022).
2.2 Mixed-integer convex programs
Mixed-integer convex programs (MICPs) integrate both continuous and discrete decision variables within an optimization framework. Their formal mathematical definition hinges on the property that if all binary variables are relaxed (i.e., replaced by continuous variables between 0 and 1), the resulting problem must be convex (Conforti et al., 2014). Standard MICP solvers such as Gurobi leverage branch-and-bound methods (Boyd and Mattingley, 2007) and cutting plane techniques (Marchand et al., 2002). Although MICPs face exponential worst-case computational complexity with respect to the number of binary variables, practical experience reveals that many real-world problems can be solved by exploring only a small portion of the potential search tree (Williams, 2013). This has facilitated the deployment of mixed-integer approaches in time-sensitive robotics applications such as online motion planning (Tordesillas et al., 2019). Nevertheless, as problem complexity increases, solution times can still extend to minutes or even hours (Lin et al., 2021), presenting significant challenges for real-time implementation.
2.3 Mathematical programming with complementary constraints
Mathematical programs with complementarity constraints (MPCCs) offer an alternative approach to handling discrete variables by reformulating them as continuous variables with additional complementarity conditions. These complementarity constraints enforce that when one variable takes a non-zero value, the other must be exactly zero. This formulation, while compact, presents significant computational challenges because it violates the majority of Constraint Qualifications established for standard nonlinear optimization (Luo et al., 1996). To address these difficulties, researchers have developed specialized algorithms including time-stepping methods (Anitescu and Potra, 1997), pivoting algorithms (Drumwright, 2015), and central path approaches (Kojima et al., 1991). In robotics applications, complementarity constraints have proven valuable for trajectory optimization across different locomotion gaits and implicit contact dynamics (Yunt and Glocker, 2006; Posa et al., 2014; Zhang et al., 2021; Le Cleac’h et al., 2024), where they capture the complementary relationship between contact forces and surface distances. Complementarity constraints can also be used to model binary decision variables (Park and Boyd, 2018).
3 Materials and methods
3.1 Data-driven parametric MIBLPs
Consider a set of optimization problems parameterized by
where
As introduced in Section 1, we compare the data-driven performance of two reformulation approaches: converting problem (1) into either an MICP or an MPCC, and solving these reformulations with data-driven methods. For the MICP reformulation, when a learning algorithm can predict binary variable assignments based on problem parameters
To implement these data-driven approaches, we first construct a dataset of size
We then use this dataset to train models that can either predict binary variables for the MICP reformulation, or generate initial guesses for all variables for the MPCC reformulation. The MICP approach only requires binary variable prediction since fixing these variables yields a convex programming problem that can be solved efficiently without warm-starting. Formally, we seek to approximate the problem-solution map
3.1.1 MPCC Re-formulation
One approach to reformulating problem (1) involves converting all binary variables into continuous variables with complementarity constraints, transforming (1) into MPCCs that can be solved using NLP solvers. In this reformulation, we remove constraint (1d) and instead impose the equivalent complementarity constraint proposed by Park and Boyd (2018):
where
Using our constructed dataset, we implement the approach proposed by Hauser (2016) which utilizes a K-nearest-neighbor (KNN) algorithm to generate multiple candidate initial guesses
3.1.2 MICP Re-formulation
Another reformulation approach converts bilinear constraints into mixed-integer linear constraints. The key idea is to partition the ranges of the variables in each constraint (1e) into small intervals, then approximate the constraint within each interval using the linear McCormick envelope relaxation (McCormick, 1976). For the bilinear constraint
We then use the Special Ordered Set of type 2 (SOS2) formulation introduced by Vielma and Nemhauser (2011). This approach introduces additional continuous variables
We refer readers to Vielma and Nemhauser (2011) for more details on SOS2 enforcement using binary variables. Briefly speaking, SOS2 constraint ensures that only two consecutive values in a vector can be non-zero, and these non-zero values must sum to 1. When applied to
For our data-driven approach, we use offline solved problems to train the learner. During online execution, the learner provides a complete set of binary variables that both fixes the binary variables in (1d) and identifies the appropriate intervals for
3.2 Experimental problem formulations
In this section, we present two problems with two different robotic models that will serve as benchmarks for evaluating the data-driven MIBLP reformulation approaches described in Section 3.1. The first model is a linear inverted pendulum with soft contact walls. The second model is a single rigid body with mode switching and contact constraints. For each problem, we detail how they can be formulated as MIBLPs following the structure of problem (1).
3.2.1 Linear inverted pendulum with soft contacts
For this problem, we model a linear inverted pendulum on a cart that can make contact with soft walls on either side. This example is a standard benchmark problem used to verify linear controllers with discrete contacts (Aydinoglu et al., 2023; Marcucci and Tedrake, 2020; Cauligi et al., 2021; Quirynen and Di Cairano, 2023).
The state vector is
We use the standard linearized dynamics for the inverted pendulum around its unstable equilibrium point (Marcucci and Tedrake, 2020). When the pendulum tip makes contact with walls on either side, contact forces are generated according to a soft-contact model. We implement this using two binary variables indicating contact with the left or right wall at a given time, respectively. The contact force is modeled as a linear spring with contact forces proportional to the penetration depth.
This problem introduces binary variables, but does not introduce bilinear constraints. Nevertheless, it still fits the MIBLP framework given by (1) and therefore serves as a valid benchmark problem here. In this formulation, the parameter
3.2.2 Single rigid body with mode switching and contacts
For this problem, we adopt the standard single rigid body dynamics formulation (Di Carlo et al., 2018), including translational and rotational dynamics. However, MIBLP formulation (1) allows arbitrary rotation angles and can model contact decisions as binary variables, avoiding the small angle approximation and pre-planned contacts used in Di Carlo et al. (2018). This enhanced formulation enables the planning of more aggressive and versatile motions, including transitions between quadrupedal and bipedal locomotion modes, as demonstrated in our experiments.
The continuous variables in this single rigid body formulation include position of geometric center of body
And the transformation from Euler angle rates to angular velocities
For the remaining dynamics equations, discretization using Euler forward integration, friction cone constraints, and leg workspace approximation as a box-shaped region, we follow the approach in Di Carlo et al. (2018).
For each bilinear term that appears in the dynamics, rotation matrix, and angular velocity computations, we augment the optimization variable
The robot is assumed to walk on flat terrain. For contact modeling, we introduce binary variables
On the other hand, when the leg is in contact
where
To enable multi-modal locomotion, we define two distinct modes: quadrupedal and bipedal locomotion, represented by a binary variable
These mode-dependent behaviors are enforced through the following constraints. For front legs
Which enforces periodic contact only in quadrupedal mode. For back legs
To enforce the upright orientation in bipedal mode, we add the constraint:
where
The toe position constraints are:
These constraints implement the mode-dependent toe lifting behavior: in quadrupedal mode, legs alternate between ground contact (
The complete formulation incorporates standard single rigid body dynamics with rotational kinematics described by Equations 4, 5, and leg feasible kinematic region is approximated using box constraints, following (Di Carlo et al., 2018). Contact modeling constraints (Equations 6, 7) enforce the complementary relationship between contact forces and toe heights. Mode-dependent locomotion behaviors are captured through contact sequence constraints (Equation 8) for front legs, body orientation requirements (Equation 9) for bipedal mode, and toe position constraints (Equation 10) that implement mode-dependent prescribed lift heights. The objective function consists of multiple weighted terms to achieve stable locomotion and reach the target position. The primary goal is to minimize the squared distance between the body center and target position
The complete formulation results in an MIBLP of the form given in Equation 1. In this formulation, the parameter
4 Results and discussion
4.1 Experiment on linear inverted pendulum with contacts
For our first benchmark problem, we implemented a linearized cart-pole system with soft contact walls on either side. The system consists of a cart with mass
To build our dataset for data-driven acceleration, we created a dataset with 50 and 500 samples distributed across the state space. We generated solutions for a grid of initial conditions, varying cart position:
For testing, we generated 200 random initial conditions different from the training set but drawn from the same distribution. These solutions were stored and used to warm-start both reformulation approaches. For the MPCC reformulation, we use IPOPT (Wachter, 2002) as the NLP solver for the complete formulation with complementary constraints. For the MICP reformulation, we use Gurobi as the convex optimization solver after fixing the binary variables.
Results Table 1 shows the computational performance comparison between different solving approaches. The “No Initial Guess MPCC” column represents solving the MPCC formulation with default zero initialization. The “Manual Initial Guess MPCC” column represents solving MPCC with a straight-line trajectory initialization that linearly interpolates all state variables from the initial state to the final upright equilibrium, with all control inputs and binary variables initialized to zero. The “Enhanced Manual Initial Guess MPCC” column represents solving MPCC with a physics-informed warm-start: if the pole is angling toward a wall, the trajectory assumes the pole tip will first move toward the wall at constant velocity, remain in contact for a specified duration, then move away from the wall at constant velocity; otherwise, it uses the straight-line interpolation. Our data-driven approaches included “Data-driven MPCC” using KNN to generate initial guesses for the MPCC formulation, and “Data-driven MICP” using KNN to predict binary variables for the MICP reformulation.
We evaluated the success rate, solving time, and solution quality for each method. The “Amount of data” row indicates the number of offline-solved problem instances in the dataset (zero for non-data-driven methods). The “Success rate” row shows the percentage of test problems solved successfully: for manual initialization methods, this represents a single solving attempt; for data-driven methods, the format
Analysis First, MPCC with default zero initialization completely fails to find feasible solutions (0% success rate), while manual straight-line initialization achieves moderate success (42.5%). The enhanced manual initialization achieves a higher success rate of 58.0%. However, designing such heuristics requires problem-specific insight and manual effort that may not scale to more complex robotic systems with higher-dimensional state spaces. The data-driven MPCC approach significantly improves performance, achieving a 76% success rate with just 50 data samples. The data-driven MICP formulation achieves a lower success rate (56%) with the same amount of data. When increasing the dataset to 500 samples, both approaches improve further, with MPCC reaching a 90% success rate and MICP improving to 71.0%. The MICP approach offers nearly a 100
It is important to note that the relaxation parameter
4.2 Experiment on single rigid body with contacts
For this experiment, we evaluate our MIBLP motion planner on a quadrupedal robot model. We use the Spine-enhanced Climbing Autonomous Legged Exploration Robot (SCALER) for demonstration, and implement an SRB model that matches the robot’s physical properties. The total mass is set to 10.0
To showcase the advantages of our MIBLP formulation, which can plan trajectories without pre-assigned gait sequences or small angle approximations, we design a challenging scenario. The environment contains a low-clearance obstacle positioned at height
Figure 2. Solving time distribution for data-driven MPCC. (A) Cart-pole with 50 samples. (B) SRB with contacts using 30 samples.
We sampled 30 and 80 initial conditions with horizontal positions spanning a 1-m range and initial orientation varying within
For testing, we generated 100 random initial conditions that were different from the training dataset and used them to evaluate both the MPCC and MICP formulations. Higher accuracy McCormick envelopes are used in these MICP tests to ensure accurate approximation of the bilinear constraints, with the number of segmentations given in Table 2 (column “testing”).
Results Table 3 shows the computational performances of different solving methods. “Manual Initial Guess MPCC” uses a common straight-line trajectory to connect the initial and goal configurations as the initial guess for the MPCC solver. All binary variables are initialized to 0.5 to provide a neutral starting point without biasing toward any particular contact mode. “Two-stage MICP-MPCC” refers to the two-stage approach used for data collection described previously: first solving a coarse MICP formulation, then refining through MPCC with the MICP solution as warm-start. “Data-driven MPCC” uses our KNN approach to generate initial guesses for the MPCC formulation, while “Data-driven MICP” uses KNN to predict binary variables for the MICP reformulation.
Table 3. Performance comparison of different solving methods for SRB with mode transitions and contacts.
Analysis We observe that both data-driven approaches benefit substantially from warm-starting with the collected dataset. Manual initialization achieves a 0% success rate, failing to find feasible solutions for any test cases. The two-stage MICP-MPCC approach used for data collection demonstrates feasibility but requires
Given the same amount of data, MPCC shows higher success rates, matching the trend observed in the cart-pole experiment where MPCC also outperformed MICP with limited training data. For computational speed, MICP demonstrates a significant advantage, solving problems in approximately
4.3 Discussion on experiment results
According to our experiments on both the linear inverted pendulum and single rigid body control problems, data-driven methods significantly improve the performance of both MPCC and MICP reformulations. As we increased the amount of data, we observed a general trend of improvement in success rates for both approaches. The key distinction between MPCC and MICP reformulations lies in the solving speeds and the amount of data used to achieve similar success rates.
For both the linear inverted pendulum problem and the single rigid body problem, the MPCC formulation achieves higher success rates than the MICP formulation with the same amount of data. We noticed that IPOPT rarely modifies the initial binary guesses for
On the other hand, the MICP formulation demonstrates significantly faster solving speeds when provided with sufficient data. This advantage likely stems from the efficiency of modern convex optimization solvers. For MPCC problems solved with IPOPT, even when we relax the complementarity constraint (2) to values between 0.005 and 0.01 and provide quality warm-start solutions, the solver frequently encounters numerical difficulties and shows slow convergence. The complementarity constraint (2) remains inherently challenging to satisfy. Notably, for the linear inverted pendulum with contact, directly solving using MICP with Gurobi can achieve global optimality at 300 Hz without any data-driven acceleration (Lin, 2024). In addition, numerical accuracy remains a concern with the MPCC approach. Although we only relax constraint (2) to values between 0.005 and 0.01, less than 1% of the binary values, these small relaxations can still lead to accuracy issues in certain cases.
The amount of training data required for effective data-driven MIBLP acceleration depends on the dimensionality of
Therefore, despite the benefits of data-driven acceleration for solving optimization problems, different formulations exhibit distinct performance characteristics and dataset requirements. The choice of problem formulation is crucial for maximizing the effectiveness of data-driven optimization methods.
5 Conclusion
In this paper, we present a comparative study of different reformulation strategies for solving MIBLPs aided by data for robot motion planning applications. We conducted numerical experiments on two motion planning problems for a linear inverted pendulum with contact walls and for quadrupedal locomotion involving quadruped-bipedal mode transformations.
While this work focuses on reformulating the MIBLP problem into more tractable problem types such that existing methods can directly apply, future work can focus on developing innovative data-driven methods tailored specifically to MIBLPs.
Data availability statement
The original contributions presented in the study are included in the article/supplementary material, further inquiries can be directed to the corresponding author.
Author contributions
XL: Writing – original draft, Writing – review and editing.
Funding
The author(s) declare that no financial support was received for the research and/or publication of this article.
Acknowledgements
The author thanks Yusuke Tanaka for providing materials and assistance with figure preparation and visualization.
Conflict of interest
The author declares that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Generative AI statement
The author(s) declare that no Generative AI was used in the creation of this manuscript.
Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
References
Anitescu, M., and Potra, F. A. (1997). Formulating dynamic multi-rigid-body contact problems with friction as solvable linear complementarity problems. Nonlinear Dyn. 14, 231–247. doi:10.1023/a:1008292328909
Aydinoglu, A., Wei, A., and Posa, M. (2023). Consensus complementarity control for multi-contact mpc. arXiv Prepr. arXiv:2304. doi:10.1109/TRO.2024.3435423
Banerjee, S., Cauligi, A., and Pavone, M. (2025). Deep learning warm starts for trajectory optimization on the international space station
Bemporad, A., Morari, M., Dua, V., and Pistikopoulos, E. N. (2002). The explicit linear quadratic regulator for constrained systems. Automatica 38, 3–20. doi:10.1016/s0005-1098(01)00174-1
Briden, J., Choi, C., Yun, K., Linares, R., and Cauligi, A. (2025). Constraint-informed learning for warm-starting trajectory optimization. J. Guid. Control, Dyn. 48, 2272–2287. doi:10.2514/1.g008791
Carvalho, J., Le, A. T., Baierl, M., Koert, D., and Peters, J. (2023). Motion planning diffusion: learning and planning of robot motions with diffusion models in 2023 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) (IEEE),1916–1923.
Cauligi, A., Culbertson, P., Schmerling, E., Schwager, M., Stellato, B., and Pavone, M. (2021). Coco: online mixed-integer control via supervised learning. IEEE Robotics Automation Lett. 7, 1447–1454. doi:10.1109/lra.2021.3135931
Conforti, M., Cornuéjols, G., Zambelli, G., Conforti, M., Cornuéjols, G., and Zambelli, G. (2014). Integer programming models. Springer.
Dai, H., Valenzuela, A., and Tedrake, R. (2014). “Whole-body motion planning with centroidal dynamics and full kinematics,” in IEEE RAS Int. Conf. on Humanoid Robots, 295–302. doi:10.1109/humanoids.2014.7041375
Dai, H., Izatt, G., and Tedrake, R. (2019). Global inverse kinematics via mixed-integer convex optimization. Int. J. Robotics Res. 38, 1420–1441. doi:10.1177/0278364919846512
Dalal, M., Yang, J., Mendonca, R., Khaky, Y., Salakhutdinov, R., and Pathak, D. (2024). Neural mp: a generalist neural motion planner. arXiv Prepr. arXiv:2409.05864. doi:10.48550/arXiv.2409.05864
Deits, R., and Tedrake, R. (2014). “Footstep planning on uneven terrain with mixed-integer convex optimization,” in 2014 IEEE-RAS international conference on humanoid robots (IEEE), 279–286.
Di Carlo, J., Wensing, P. M., Katz, B., Bledt, G., and Kim, S. (2018). “Dynamic locomotion in the mit cheetah 3 through convex model-predictive control,” in 2018 IEEE/RSJ international conference on intelligent robots and systems (IROS) (IEEE), 1–9.
Ding, Y., Khazoom, C., Chignoli, M., and Kim, S. (2022). “Orientation-aware model predictive control with footstep adaptation for dynamic humanoid walking,” in 2022 IEEE-RAS 21st International Conference on Humanoid Robots (Humanoids) (IEEE), 299–305.
Drumwright, E. (2015). Rapidly computable viscous friction and no-slip rigid contact models. arXiv Prepr. arXiv:1504.00719. doi:10.48550/arXiv.1504.00719
Dua, V., and Pistikopoulos, E. N. (1999). Algorithms for the solution of multiparametric mixed-integer nonlinear optimization problems. Industrial and Eng. Chem. Res. 38, 3976–3987. doi:10.1021/ie980792u
Dua, V., and Pistikopoulos, E. N. (2000). An algorithm for the solution of multiparametric mixed integer linear programming problems. Ann. Operations Res. 99, 123–139. doi:10.1023/a:1019241000636
Fiacco, A. V. (1983). Ntroduction to sensitivity and stability analysis in nonlinear programming I, 165. Academic Press.
Fischetti, M., and Monaci, M. (2020). A branch-and-cut algorithm for mixed-integer bilinear programming. Eur. J. Operational Res. 282, 506–514. doi:10.1016/j.ejor.2019.09.043
Gal, T. (2010). Postoptimal analyses in Parametric Programming, and Related Topics: degeneracy, multicriteria decision making, redundancy (Walter de Gruyter).
Geoffrion, A. M. (1972). Generalized benders decomposition. J. Optim. Theory Appl. 10, 237–260. doi:10.1007/bf00934810
Gupte, A., Ahmed, S., Cheon, M. S., and Dey, S. (2013). Solving mixed integer bilinear problems using milp formulations. SIAM J. Optim. 23, 721–744. doi:10.1137/110836183
Hauser, K. (2016). Learning the problem-optimum map: analysis and application to global optimization in robotics. IEEE Trans. Robotics 33, 141–152. doi:10.1109/tro.2016.2623345
Hogan, F. R., and Rodriguez, A. (2020). Reactive planar non-prehensile manipulation with hybrid model predictive control. Int. J. Robotics Res. 39, 755–773. doi:10.1177/0278364920913938
Kojima, M., Megiddo, N., Noma, T., and Yoshise, A. (1991). A unified approach to interior point algorithms for linear complementarity problems: a summary. Operations Res. Lett. 10, 247–254. doi:10.1016/0167-6377(91)90010-m
Le Cleac’h, S., Howell, T. A., Yang, S., Lee, C.-Y., Zhang, J., Bishop, A., et al. (2024). Fast contact-implicit model predictive control. IEEE Transactions on Robotics 40, 1617–1629. doi:10.1109/TRO.2024.3351554
Li, A., Ding, Z., Dieng, A. B., and Beeson, R. (2024). “Constraint-aware diffusion models for trajectory optimization,” in International Conference on Dynamic Data Driven Applications Systems (Springer), 308–316.
Lin, X. (2024). Accelerate hybrid model predictive control using generalized benders decomposition. arXiv Prepr. arXiv:2406.00780. doi:10.48550/arXiv.2406.00780
Lin, X., Zhang, J., Shen, J., Fernandez, G., and Hong, D. W. (2019). “Optimization based motion planning for multi-limbed vertical climbing robots,” in 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) (IEEE), 1918–1925.
Lin, X., Ahn, M. S., and Hong, D. (2021). “Designing multi-stage coupled convex programming with data-driven mccormick envelope relaxations for motion planning,” in 2021 IEEE/RSJ International Conference on Robotics and Automation (ICRA) (IEEE).
Lin, X., Fernandez, G. I., and Hong, D. W. (2022). “Reduce: reformulation of mixed integer programs using data from unsupervised clusters for learning efficient strategies,” in 2022 International Conference on Robotics and Automation (ICRA) (IEEE), 4459–4465.
Lin, X., Fernandez, G. I., and Hong, D. W. (2024). “Evaluating data-driven performances of mixed integer bilinear formulations for book placement planning,” in 2024 21st International Conference on Ubiquitous Robots (UR) (IEEE), 482–490.
Luo, Z.-Q., Pang, J.-S., and Ralph, D. (1996). Mathematical programs with equilibrium constraints. Cambridge University Press.
Marchand, H., Martin, A., Weismantel, R., and Wolsey, L. (2002). Cutting planes in integer and mixed integer. Discrete Optim. State Art 11, 397. doi:10.1016/S0166-218X(01)00348-1
Marcucci, T. (2024). Graphs of convex sets with applications to optimal control and motion planning. Cambridge, MA: Massachusetts Institute of Technology.
Marcucci, T., and Tedrake, R. (2020). Warm start of mixed-integer programs for model predictive control of hybrid systems. IEEE Trans. Automatic Control 66, 2433–2448. doi:10.1109/tac.2020.3007688
McCormick, G. P. (1976). Computability of global solutions to factorable nonconvex programs: part i—convex underestimating problems. Math. Program. 10, 147–175. doi:10.1007/bf01580665
Nair, V., Bartunov, S., Gimeno, F., von Glehn, I., Lichocki, P., Lobov, I., et al. (2020). Solving mixed integer programs using neural networks. arXiv Prepr. arXiv:2012.13349. doi:10.48550/arXiv.2012.13349
Park, J., and Boyd, S. (2018). A semidefinite programming method for integer convex quadratic minimization. Optim. Lett. 12, 499–518. doi:10.1007/s11590-017-1132-y
Pistikopoulos, E. N., Dua, V., Bozinis, N. A., Bemporad, A., and Morari, M. (2002). On-line optimization via off-line parametric optimization tools. Comput. and Chem. Eng. 26, 175–185. doi:10.1016/s0098-1354(01)00739-6
Posa, M., Cantu, C., and Tedrake, R. (2014). A direct method for trajectory optimization of rigid bodies through contact. Int. J. Robotics Res. 33, 69–81. doi:10.1177/0278364913506757
Power, T., and Berenson, D. (2022). Variational inference mpc using normalizing flows and out-of-distribution projection. arXiv Prepr. arXiv:2205.04667. doi:10.48550/arXiv.2205.04667
Quirynen, R., and Di Cairano, S. (2023). Tailored presolve techniques in branch-and-bound method for fast mixed-integer optimal control applications. Optim. Control Appl. Methods 44, 3139–3167. doi:10.1002/oca.3030
Schperberg, A., Shirai, Y., Lin, X., Tanaka, Y., and Hong, D. (2023). “Adaptive force controller for contact-rich robotic systems using an unscented kalman filter,” in 2023 IEEE-RAS 22nd International Conference on Humanoid Robots (Humanoids) (IEEE), 1–8.
Shirai, Y., Lin, X., Tanaka, Y., Mehta, A., and Hong, D. (2020). Risk-aware motion planning for a limbed robot with stochastic gripping forces using nonlinear programming. IEEE Robotics Automation Lett. 5, 4994–5001. doi:10.1109/lra.2020.3001503
Stein, O., Oldenburg, J., and Marquardt, W. (2004). Continuous reformulations of discrete–continuous optimization problems. Comput. and Chem. Eng. 28, 1951–1966. doi:10.1016/j.compchemeng.2004.03.011
Tanaka, Y., Shirai, Y., Lin, X., Schperberg, A., Kato, H., Swerdlow, A., et al. (2022). “Scaler: a tough versatile quadruped free-climber robot,” in 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) (IEEE), 5632–5639.
Tang, G., and Hauser, K. (2019). A data-driven indirect method for nonlinear optimal control. Astrodynamics 3, 345–359. doi:10.1007/s42064-019-0051-3
Tang, Y., Agrawal, S., and Faenza, Y. (2020). “Reinforcement learning for integer programming: learning to cut,” in International Conference on Machine Learning (PMLR), 9367–9376.
Tøndel, P., Johansen, T. A., and Bemporad, A. (2003). An algorithm for multi-parametric quadratic programming and explicit mpc solutions. Automatica 39, 489–497. doi:10.1016/s0005-1098(02)00250-9
Tordesillas, J., Lopez, B. T., and How, J. P. (2019). Faster: fast and safe trajectory planner for flights in unknown environments in 2019 IEEE/RSJ international conference on intelligent robots and systems (IROS) (IEEE), 1934–1940.
Vielma, J. P., and Nemhauser, G. L. (2011). Modeling disjunctive constraints with a logarithmic number of binary variables and constraints. Math. Program. 128, 49–72. doi:10.1007/s10107-009-0295-4
Wachter, A. (2002). An interior point algorithm for large-scale nonlinear optimization with applications in process engineering. Carnegie Mellon University.
Winkler, A. W., Bellicoso, C. D., Hutter, M., and Buchli, J. (2018). Gait and trajectory optimization for legged systems through phase-based end-effector parameterization. IEEE Robotics Automation Lett. 3, 1560–1567. doi:10.1109/lra.2018.2798285
Yunt, K., and Glocker, C. (2006). “Trajectory optimization of mechanical hybrid systems using sumt,” in 9th IEEE International Workshop on Advanced Motion Control, 2006 (IEEE), 665–671.
Zhang, J., Lin, X., and Hong, D. W. (2021). “Transition motion planning for multi-limbed vertical climbing robots using complementarity constraints,” in 2021 IEEE International Conference on Robotics and Automation (ICRA) (IEEE), 2033–2039.
Keywords: trajectory optimization, robot motion planning, optimal control, mixed-integer bilinear programs, mathematical programs with complementarity constraints
Citation: Lin X (2026) Data-driven acceleration of mixed-integer bilinear programs: a comparative study for robot motion planning. Front. Robot. AI 12:1656564. doi: 10.3389/frobt.2025.1656564
Received: 30 June 2025; Accepted: 27 October 2025;
Published: 08 January 2026.
Edited by:
Yanran Ding, University of Michigan, United StatesReviewed by:
Roohollah Barzamini, Islamic Azad University Central Tehran Branch, IranAntony Thomas, International Institute of Information Technology, Hyderabad, India
Copyright © 2026 Lin. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Xuan Lin, bWF5bmlnaHRAdWNsYS5lZHU=