pith. sign in

arxiv: 0812.0933 · v1 · submitted 2008-12-04 · 💻 cs.LG · cs.CC

Decision trees are PAC-learnable from most product distributions: a smoothed analysis

classification 💻 cs.LG cs.CC
keywords decisionproducttreesdistributionalgorithmconsiderexamplesrandom
0
0 comments X
read the original abstract

We consider the problem of PAC-learning decision trees, i.e., learning a decision tree over the n-dimensional hypercube from independent random labeled examples. Despite significant effort, no polynomial-time algorithm is known for learning polynomial-sized decision trees (even trees of any super-constant size), even when examples are assumed to be drawn from the uniform distribution on {0,1}^n. We give an algorithm that learns arbitrary polynomial-sized decision trees for {\em most product distributions}. In particular, consider a random product distribution where the bias of each bit is chosen independently and uniformly from, say, [.49,.51]. Then with high probability over the parameters of the product distribution and the random examples drawn from it, the algorithm will learn any tree. More generally, in the spirit of smoothed analysis, we consider an arbitrary product distribution whose parameters are specified only up to a [-c,c] accuracy (perturbation), for an arbitrarily small positive constant c.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Decision Tree Learning on Product Spaces

    cs.LG 2026-05 unverdicted novelty 7.0

    For any function computable by an optimal decision tree with size s, max depth D_opt and average depth Δ_opt, the greedy heuristic builds an ε-approximating tree of size at most exp(Δ_opt D_opt log(e/ε)) under arbitra...

  2. ID3 Learns Juntas for Smoothed Product Distributions

    cs.LG 2019-06 unverdicted novelty 6.0

    ID3 learns log n-juntas in polynomial time under the smoothed analysis model for product distributions.