pith. sign in

arxiv: 2205.15569 · v2 · pith:HHCEIZFOnew · submitted 2022-05-31 · 💻 cs.LG · cs.AI· cs.NE

GSR: A Generalized Symbolic Regression Approach

classification 💻 cs.LG cs.AIcs.NE
keywords approachbenchmarkproblemregressionrelationshipssymbolicchallengingdataset
0
0 comments X
read the original abstract

Identifying the mathematical relationships that best describe a dataset remains a very challenging problem in machine learning, and is known as Symbolic Regression (SR). In contrast to neural networks which are often treated as black boxes, SR attempts to gain insight into the underlying relationships between the independent variables and the target variable of a given dataset by assembling analytical functions. In this paper, we present GSR, a Generalized Symbolic Regression approach, by modifying the conventional SR optimization problem formulation, while keeping the main SR objective intact. In GSR, we infer mathematical relationships between the independent variables and some transformation of the target variable. We constrain our search space to a weighted sum of basis functions, and propose a genetic programming approach with a matrix-based encoding scheme. We show that our GSR method is competitive with strong SR benchmark methods, achieving promising experimental performance on the well-known SR benchmark problem sets. Finally, we highlight the strengths of GSR by introducing SymSet, a new SR benchmark set which is more challenging relative to the existing benchmarks.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Data Enrichment for Symbolic Regression Using Diffusion Models

    cs.LG 2026-05 unverdicted novelty 5.0

    A variational autoencoder plus conditional latent diffusion model with a physics-informed residual corrector generates synthetic fields that improve symbolic regression recovery on sparse heat conduction, Navier-Stoke...