Improved Learning Dynamics and Model Compactness
in Symbolic Regression
Lachlan Stewart (presenting)1 · Gorka Silva2 · Leonardo Trujillo3 · Illya Bakurov4 · Mauro Castelli5 · Davide Farinati5 · Jose Manuel Muñoz Contreras3 · Leonardo Vanneschi5
1 Australian National University · 2 Universidad Complutense de Madrid · 3 Tecnológico Nacional de México · 4 Michigan State University · 5 NOVA IMS, Universidade Nova de Lisboa
Université Toulouse Capitole, Rempart Building
Toulouse, France · 9 April 2026
A 2025 SPECIES Summer School Production
Deep Learning · Ensembles
Accurate but opaque
Genetic Programming
Interpretable but hard to search
Can we get both?
Vanneschi, L. (2024). SLIM_GSGP: The Non-bloating Geometric Semantic Genetic Programming. In Genetic Programming, Giacobini, Xue & Manzoni (Eds.). Springer Nature Switzerland, 125–141.
ABS mutation:
each component bounded in (
Moraglio, A., Krawiec, K. & Johnson, C.G. (2012). Geometric Semantic Genetic Programming. In Parallel Problem Solving from Nature — PPSN XII, Coello Coello et al. (Eds.). LNCS Vol. 7491. Springer Berlin Heidelberg, 21–31.
But each GSGP mutation adds syntax — models grow at every step ↑
Vanneschi, L. (2024). SLIM_GSGP: The Non-bloating Geometric Semantic Genetic Programming. In Genetic Programming, Giacobini, Xue & Manzoni (Eds.). Springer Nature Switzerland, 125–141.
Model can grow and shrink during evolution
Vanneschi, L. (2024). SLIM_GSGP: The Non-bloating Geometric Semantic Genetic Programming. In Genetic Programming, Giacobini, Xue & Manzoni (Eds.). Springer Nature Switzerland, 125–141.
Clip extreme values · zero out negligible ones → mutation cancelled → implicit size reduction
Ivo Gonçalves, Sara Silva, and Carlos M. Fonseca. 2015. On the Generalization Ability of Geometric Semantic Genetic Programming. In Genetic Programming, Penousal Machado, Malcolm I. Heywood, James McDermott, Mauro Castelli, Pablo García-Sánchez, Paolo Burelli, Sebastian Risi, and Kevin Sim (Eds.). Springer International Publishing, Cham, 41–52.
James McDermott, Alexandros Agapitos, Anthony Brabazon, and Michael O’Neill. 2014. Geometric semantic genetic programming for financial data. In Applications of Evolutionary Computation: 17th European Conference, EvoApplications 2014, Granada, Spain, April 23-25, 2014, Revised Selected Papers 17. Springer, 215–226.
Fit slope
OMS: local step optimality · LS: global calibration · Complementary
Maarten Keijzer. 2003. Improving Symbolic Regression with Interval Arithmetic and Linear Scaling. In Genetic Programming, Conor Ryan, Terence Soule, Maarten Keijzer, Edward Tsang, Riccardo Poli, and Ernesto Costa (Eds.). Springer Berlin Heidelberg, Berlin, Heidelberg, 70–82
Giorgia Nadizar, Fraser Garrow, Berfin Sakallioglu, Lorenzo Canonne, Sara Silva, and Leonardo Vanneschi. 2023. An Investigation of Geometric Semantic GP with Linear Scaling. In Proceedings of the Genetic and Evolutionary Computation Conference (Lisbon, Portugal) (GECCO ’23). Association for Computing Machinery, New York, NY, USA, 1165–1174. doi:10.1145/3583131.3590418
* BF, OC, BS are used for selection from the final population, not during each evolutionary step
Works best when combined with LS and OMS
Edwin D. de Jong and Jordan B. Pollack. 2003. Multi-Objective Methods for Tree Size Control. Genetic Programming and Evolvable Machines 4, 3 (Sept. 2003), 211–233. doi:10.1023/a:1025122906870
Mark Kotanchek, Guido Smits, and Ekaterina Vladislavleva. 2007. Pursuing the Pareto Paradigm: Tournaments, Algorithm Variations and Ordinal Optimization. Springer US, 167–185. doi:10.1007/978-0-387-49650-4_11
Guido F. Smits and Mark Kotanchek. 2005. Pareto-Front Exploitation in Symbolic Regression. Springer US, Boston, MA, 283–299. doi:10.1007/0-387-23254-0_17
Applied to Pareto-front models before final selection.
Modest effect alone — ABS and SIG functions resist simplification. Amplifies OMS: zeroed mutation steps get cleaned up.
we studied the
baseline SLIM
we studied the addition
of each to BASE
we studied the combination
of all additions
we studied the subtraction
of each from ALL
This was the most efficient way to understand how each addition affects the accuracy and size of models, resulting in 10 variants.
Datasets — same as the original SLIM GSGP paper
How we measured
Four simple, well-known enhancements combine synergistically to evolve more accurate and dramatically smaller models
Co-Authors
Gorka Silva1 · Lachlan Stewart 2 · Leonardo Trujillo3 · Illya Bakurov4 · Mauro Castelli5 · Davide Farinati5 · Jose Manuel Muñoz Contreras3 · Leonardo Vanneschi5
1 Universidad Complutense de Madrid · 2 Australian National University · 3 Tecnológico Nacional de México · 4 Michigan State University · 5 NOVA IMS, Universidade Nova de Lisboa
Special thanks to Leonardo Trujillo for outstanding mentorship throughout the 2025 SPECIES Summer School and beyond.
Thank you to SPECIES and its organisers for putting on the conference and the Summer School. Apply for 2026! species-society.org/summer-school-2026
EuroGP 2026 · EvoStar · Toulouse, France