Clusters are All You Need: Pre-Training the Tsetlin Machine with Semantic Clusters from Language Models for Interpretability

Jiechao Gao; Jie Wang; Michael Lepech; Rohan Kumar Yadav; Ying Liu; Yuandong Pan; Yuangang Li

arxiv: 2606.19815 · v1 · pith:BGY5ZSMYnew · submitted 2026-06-18 · 💻 cs.CL

Clusters are All You Need: Pre-Training the Tsetlin Machine with Semantic Clusters from Language Models for Interpretability

Jiechao Gao , Rohan Kumar Yadav , Yuangang Li , Yuandong Pan , Jie Wang , Ying Liu , Michael Lepech This is my paper

classification 💻 cs.CL

keywords semanticclustersinterpretablelanguagebertembeddingsmachinemodels

0 comments

read the original abstract

Pre-trained language models such as BERT achieve strong text classification performance but lack transparency, limiting their use in high-stakes settings. The Tsetlin Machine (TM) offers fully interpretable, clause-based reasoning but captures little semantic information, and prior attempts to bridge the two rely on static word embeddings that miss contextual meaning. We propose a semantic pre-training framework that transfers knowledge from a pre-trained language model into a TM without using embeddings. Text samples are grouped into semantically coherent clusters with K-means or Top2Vec, and the resulting cluster-sample pairs pre-train a non-negated TM with enhanced Type I feedback. The TM thereby learns interpretable semantic keywords that are fine-tuned on downstream tasks. Across five datasets, our method substantially outperforms vanilla and embedding-based TMs and reaches performance competitive with BERT while remaining interpretable.

This paper has not been read by Pith yet.

Clusters are All You Need: Pre-Training the Tsetlin Machine with Semantic Clusters from Language Models for Interpretability

discussion (0)