pith. sign in

arxiv: 2606.07664 · v1 · pith:2JTPEYNHnew · submitted 2026-06-03 · 💻 cs.NE · cs.AI

Seq103: A Unified Neuroevolution Framework for Compact Sequence Architecture Discovery

Pith reviewed 2026-06-28 02:30 UTC · model grok-4.3

classification 💻 cs.NE cs.AI
keywords neuroevolutionneural architecture searchsequence classificationcompact modelstime series classificationtext classificationrecurrent networks
0
0 comments X

The pith

Seq103 evolves compact sequence models that retain 82-87% of baseline accuracy with up to 160,000 times fewer parameters.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

Seq103 presents a unified neuroevolution framework that evolves both network topology and weights for sequence classification. A shared backbone handles the core search for all tasks, while an optional hidden-state extension adds temporal memory only when needed for recurrent step-wise inference. The same pipeline therefore covers both recurrent and feedforward settings without redesign. On text and time-series benchmarks the evolved models keep most of the accuracy of much larger baselines while using dramatically fewer parameters. This matters because it demonstrates that a single evolutionary process can produce practical, deployable sequence classifiers across different inference modes.

Core claim

Seq103 consists of a shared evolutionary backbone and an optional recurrent extension. The backbone uses an elementary node-and-connection representation, per-class RMSE-based evaluation, mutation-based evolution with class-wise recombination, and elitism. The hidden-state extension adds hidden nodes and connections when step-wise recurrent inference is required. With the hidden-state mechanism enabled for recurrent tasks and disabled for feedforward tasks, the same core search produces compact architectures that retain 86.96% of best-baseline accuracy on average for step-wise tasks using 34.6x to 3218.0x fewer parameters and 81.95% for sample-wise tasks over UCRArchive2018 using 11.8x to 16

What carries the argument

The shared evolutionary backbone with elementary node-and-connection representation, per-class RMSE evaluation, mutation-based evolution with class-wise recombination, and elitism, plus optional hidden-state nodes for temporal memory.

Load-bearing premise

The evolutionary search with the elementary node-and-connection representation, per-class RMSE evaluation, mutation-based evolution with class-wise recombination, and elitism will reliably produce architectures that generalize to held-out test data across the reported benchmarks.

What would settle it

Seq103 search run on additional sequence datasets where the evolved models fail to retain at least 80% of best-baseline accuracy while using at least 10 times fewer parameters would falsify the central performance claim.

Figures

Figures reproduced from arXiv: 2606.07664 by Qing Xie, Wenxiao Li, Yongjian Liu.

Figure 1
Figure 1. Figure 1: Per-dataset accuracy behavior of Seq103 over all 128 UCR datasets. [PITH_FULL_IMAGE:figures/full_fig_p012_1.png] view at source ↗
Figure 2
Figure 2. Figure 2: Critical difference diagram of Seq103 and the baseline methods on the 128 UCR datasets. [PITH_FULL_IMAGE:figures/full_fig_p013_2.png] view at source ↗
read the original abstract

Neuroevolution is a representative neural architecture search paradigm that evolves both network topology and weights through evolutionary algorithms. In this paper, we propose Seq103, a unified NEAT-style neuroevolution framework for compact sequence architecture discovery. Seq103 consists of a shared evolutionary backbone and an optional recurrent extension. The shared backbone includes an elementary node-and-connection representation, per-class RMSE-based evaluation, mutation-based evolution with class-wise recombination, and elitism. The optional hidden-state mechanism extends the search space with hidden-state nodes and hidden connections, enabling temporal memory when step-wise recurrent inference is required. With this design, Seq103 applies the same core search pipeline to both step-wise recurrent and sample-wise feedforward sequence classification. In recurrent tasks, the hidden-state extension is enabled to provide temporal memory; in feedforward tasks, it is disabled while the shared evolutionary backbone remains unchanged. We evaluate Seq103 on 8 text classification datasets and the full UCRArchive2018 benchmark with 128 univariate time-series datasets. On step-wise tasks, Seq103 retains 86.96% of the best-baseline accuracy on average while using 34.6x to 3218.0x fewer parameters. On sample-wise tasks over the full UCRArchive2018 benchmark, Seq103 retains 81.95% of the best-baseline accuracy on average while using 11.8x to 160,601.0x fewer parameters.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 0 minor

Summary. The manuscript presents Seq103 as a unified neuroevolution framework extending NEAT for compact sequence architecture discovery. It maintains a shared backbone consisting of an elementary node-and-connection genome representation, per-class RMSE fitness evaluation, mutation-based evolution incorporating class-wise recombination, and elitism. An optional hidden-state mechanism allows extension to recurrent inference for step-wise tasks. The same pipeline is used for both recurrent (with hidden states enabled) and feedforward (disabled) sequence classification. Claims are supported by evaluations on 8 text classification datasets for step-wise tasks, retaining on average 86.96% of the best baseline accuracy with 34.6x to 3218.0x fewer parameters, and on the full UCRArchive2018 benchmark of 128 univariate time-series datasets for sample-wise tasks, retaining 81.95% accuracy with 11.8x to 160,601.0x fewer parameters.

Significance. If the empirical results hold under rigorous validation, the work would offer a notable contribution to neuroevolution by demonstrating a single search pipeline that achieves extreme parameter reduction across both recurrent and feedforward sequence tasks while retaining substantial accuracy. The scale of evaluation on the full UCRArchive2018 is a strength, and the unification via optional hidden-state extension is conceptually clean. However, the significance is currently limited by the absence of variance, statistical testing, and split details needed to confirm generalization.

major comments (2)
  1. [Abstract] Abstract: The headline retention rates (86.96% on step-wise tasks and 81.95% on sample-wise tasks) and the associated parameter reduction ranges are stated without any reference to the number of independent evolutionary runs performed, standard deviations across runs, or statistical significance tests against baselines. This is load-bearing for the central claim because neuroevolution results are known to exhibit high variance; without these, it is impossible to determine whether the reported compactness-accuracy tradeoff reflects reliable discovery or run-specific outcomes.
  2. [Abstract] Abstract (per-class RMSE-based evaluation and evolutionary process): The fitness function is described as per-class RMSE with mutation-based evolution and elitism, but no information is provided on whether fitness evaluation uses a validation split strictly isolated from the final held-out test sets, or on the train/validation/test partitioning protocol. This directly affects the weakest assumption that the search produces architectures that generalize, as overlap or leakage between fitness data and test data would invalidate the generalization claims.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive feedback on result reporting and experimental protocol. We address each major comment below and will revise the manuscript accordingly to improve clarity and rigor.

read point-by-point responses
  1. Referee: [Abstract] Abstract: The headline retention rates (86.96% on step-wise tasks and 81.95% on sample-wise tasks) and the associated parameter reduction ranges are stated without any reference to the number of independent evolutionary runs performed, standard deviations across runs, or statistical significance tests against baselines. This is load-bearing for the central claim because neuroevolution results are known to exhibit high variance; without these, it is impossible to determine whether the reported compactness-accuracy tradeoff reflects reliable discovery or run-specific outcomes.

    Authors: We agree that neuroevolution exhibits high variance and that the absence of run counts, standard deviations, and statistical tests weakens the central claims. The manuscript reports only aggregate retention rates without these details. We will revise the abstract and results sections to specify the number of independent evolutionary runs, include standard deviations, and report statistical significance tests against baselines. revision: yes

  2. Referee: [Abstract] Abstract (per-class RMSE-based evaluation and evolutionary process): The fitness function is described as per-class RMSE with mutation-based evolution and elitism, but no information is provided on whether fitness evaluation uses a validation split strictly isolated from the final held-out test sets, or on the train/validation/test partitioning protocol. This directly affects the weakest assumption that the search produces architectures that generalize, as overlap or leakage between fitness data and test data would invalidate the generalization claims.

    Authors: We acknowledge that explicit details on data partitioning are required to support generalization claims. The manuscript does not currently describe the train/validation/test protocol or confirm isolation of fitness data from test sets. We will add a clear description of the partitioning protocol in the revised manuscript, specifying that per-class RMSE fitness uses training data with a strictly held-out test set and no leakage. revision: yes

Circularity Check

0 steps flagged

No circularity: purely empirical claims with no derivations or self-referential fits

full rationale

The manuscript describes a neuroevolution method and reports benchmark retention rates (86.96% and 81.95% of baseline accuracy) together with parameter-reduction factors. These are presented as measured experimental outcomes on held-out test sets from UCRArchive2018 and text-classification corpora. No equations, uniqueness theorems, ansatzes, or parameter-fitting steps appear in the abstract or surrounding context; the evolutionary search components (node-connection genome, per-class RMSE, mutation, elitism) are described as design choices whose performance is evaluated directly rather than derived from prior results by the same authors. Because the central claims rest on external benchmark measurements rather than any reduction to fitted inputs or self-citations, the derivation chain contains no circular steps.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Abstract provides no explicit free parameters, axioms, or invented entities; assessment limited by lack of full text.

pith-pipeline@v0.9.1-grok · 5814 in / 1053 out tokens · 72134 ms · 2026-06-28T02:30:10.057894+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

45 extracted references · 2 canonical work pages · 1 internal anchor

  1. [1]

    and Ng, Andrew and Potts, Christopher

    Socher, Richard and Perelygin, Alex and Wu, Jean and Chuang, Jason and Manning, Christopher D. and Ng, Andrew and Potts, Christopher. Recursive Deep Models for Semantic Compositionality Over a Sentiment Treebank. Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing. 2013

  2. [2]

    and Daly, Raymond E

    Maas, Andrew L. and Daly, Raymond E. and Pham, Peter T. and Huang, Dan and Ng, Andrew Y. and Potts, Christopher , title =. Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies , month =. 2011 , address =

  3. [3]

    Bo Pang and Lillian Lee , title =

  4. [4]

    NIPS , year=

    Character-level Convolutional Networks for Text Classification , author=. NIPS , year=

  5. [5]

    Automated Learning of Decision Rules for Text Categorization , journal =

    Chidanand Apt. Automated Learning of Decision Rules for Text Categorization , journal =

  6. [6]

    Demszky, Dorottya and Movshovitz-Attias, Dana and Ko, Jeongwoo and Cowen, Alan and Nemade, Gaurav and Ravi, Sujith , booktitle =

  7. [8]

    Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence (IJCAI 2016) , pages =

    Liu, Pengfei and Qiu, Xipeng and Huang, Xuanjing , title =. Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence (IJCAI 2016) , pages =

  8. [9]

    Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP) , pages =

    Kim, Yoon , title =. Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP) , pages =

  9. [10]

    Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies , pages =

    Yang, Zichao and Yang, Diyi and Dyer, Chris and He, Xiaodong and Smola, Alex and Hovy, Eduard , title =. Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies , pages =

  10. [11]

    Vaswani, Ashish and Shazeer, Noam and Parmar, Niki and Uszkoreit, Jakob and Jones, Llion and Gomez, Aidan N. and. Attention Is All You Need , booktitle =

  11. [12]

    Keras-TextClassification , author =

  12. [13]

    Data Mining and Knowledge Discovery , Year =

    Deep learning for time series classification: a review , Author =. Data Mining and Knowledge Discovery , Year =

  13. [14]

    2018 , month =

    The UCR Time Series Classification Archive , author =. 2018 , month =

  14. [15]

    2017 International Joint Conference on Neural Networks (IJCNN) , pages =

    Wang, Zhiguang and Yan, Weizhong and Oates, Tim , title =. 2017 International Joint Conference on Neural Networks (IJCNN) , pages =

  15. [16]

    and Weber, Jonathan and Webb, Geoffrey I

    Ismail Fawaz, Hassan and Lucas, Benjamin and Forestier, Germain and Pelletier, Charlotte and Schmidt, Daniel F. and Weber, Jonathan and Webb, Geoffrey I. and Petitjean, Fran. InceptionTime: Finding AlexNet for Time Series Classification , journal =

  16. [17]

    Data Mining and Knowledge Discovery , volume =

    Bagnall, Anthony and Lines, Jason and Bostrom, Aaron and Large, James and Keogh, Eamonn , title =. Data Mining and Knowledge Discovery , volume =

  17. [18]

    ECML/PKDD Workshop on Advanced Analytics and Learning on Temporal Data , year =

    Le Guennec, Arthur and Malinowski, Simon and Tavenard, Romain , title =. ECML/PKDD Workshop on Advanced Analytics and Learning on Temporal Data , year =

  18. [19]

    Leon , title =

    Zheng, Yi and Liu, Qi and Chen, Enhong and Ge, Yong and Zhao, J. Leon , title =. International Conference on Web-Age Information Management , pages =

  19. [20]

    Towards a Universal Neural Network Encoder for Time Series , booktitle =

    Serr. Towards a Universal Neural Network Encoder for Time Series , booktitle =

  20. [21]

    and Lu, H

    Zhao, B. and Lu, H. and Chen, S. and Liu, J. and Wu, D. , title =. Journal of Systems Engineering and Electronics , volume =

  21. [22]

    Data Mining and Knowledge Discovery , volume =

    Ismail Fawaz, Hassan and Forestier, Germain and Weber, Jonathan and Idoumghar, Lhassane and Muller, Pierre-Alain , title =. Data Mining and Knowledge Discovery , volume =

  22. [24]

    and Miikkulainen, Risto , title =

    Stanley, Kenneth O. and Miikkulainen, Risto , title =. Evolutionary Computation , volume =

  23. [25]

    , author Damerau, F

    author Apt \'e , C. , author Damerau, F. , author Weiss, S.M. , year 1994 . title Automated learning of decision rules for text categorization . journal ACM Transactions on Information Systems

  24. [26]

    , author Lines, J

    author Bagnall, A. , author Lines, J. , author Bostrom, A. , author Large, J. , author Keogh, E. , year 2017 . title The great time series classification bake off: A review and experimental evaluation of recent algorithmic advances . journal Data Mining and Knowledge Discovery volume 31 , pages 606--660

  25. [27]

    , author Keogh, E

    author Dau, H.A. , author Keogh, E. , author Kamgar, K. , author Yeh, C.C.M. , author Zhu, Y. , author Gharghabi, S. , author Ratanamahatana, C.A. , author Yanping , author Hu, B. , author Begum, N. , author Bagnall, A. , author Mueen, A. , author Batista, G. , author Hexagon-ML , year 2018 . title The ucr time series classification archive . note https:/...

  26. [28]

    , author Movshovitz-Attias, D

    author Demszky, D. , author Movshovitz-Attias, D. , author Ko, J. , author Cowen, A. , author Nemade, G. , author Ravi, S. , year 2020 . title GoEmotions: A Dataset of Fine-Grained Emotions , in: booktitle 58th Annual Meeting of the Association for Computational Linguistics (ACL)

  27. [29]

    , author Forestier, G

    author Ismail Fawaz, H. , author Forestier, G. , author Weber, J. , author Idoumghar, L. , author Muller, P.A. , year 2019 . title Deep learning for time series classification: a review . journal Data Mining and Knowledge Discovery volume 33 , pages 917--963

  28. [30]

    , author Lucas, B

    author Ismail Fawaz, H. , author Lucas, B. , author Forestier, G. , author Pelletier, C. , author Schmidt, D.F. , author Weber, J. , author Webb, G.I. , author Petitjean, F. , author Muller, P.A. , year 2020 . title Inceptiontime: Finding alexnet for time series classification . journal Data Mining and Knowledge Discovery volume 34 , pages 1936--1962

  29. [31]

    Bag of Tricks for Efficient Text Classification

    author Joulin, A. , author Grave, E. , author Bojanowski, P. , author Mikolov, T. , year 2016 . title Bag of tricks for efficient text classification . journal arXiv preprint arXiv:1607.01759

  30. [32]

    , year 2014

    author Kim, Y. , year 2014 . title Convolutional neural networks for sentence classification , in: booktitle Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP) , pp. pages 1746--1751

  31. [33]

    , author Malinowski, S

    author Le Guennec, A. , author Malinowski, S. , author Tavenard, R. , year 2016 . title Data augmentation for time series classification using convolutional neural networks , in: booktitle ECML/PKDD Workshop on Advanced Analytics and Learning on Temporal Data

  32. [34]

    , author Qiu, X

    author Liu, P. , author Qiu, X. , author Huang, X. , year 2016 . title Recurrent neural network for text classification with multi-task learning , in: booktitle Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence (IJCAI 2016) , pp. pages 2873--2879

  33. [35]

    , author Daly, R.E

    author Maas, A.L. , author Daly, R.E. , author Pham, P.T. , author Huang, D. , author Ng, A.Y. , author Potts, C. , year 2011 . title Learning word vectors for sentiment analysis , in: booktitle Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies , publisher Association for Computational Lin...

  34. [36]

    , author Kallada, M

    author McIntyre, A. , author Kallada, M. , author Miguel, C.G. , author Feher de Silva, C. , author Netto, M.L. , . title neat-python . https://github.com/CodeReclaimers/neat-python, :10.5281/zenodo.19024753

  35. [37]

    , year 2019

    author Mo, Y. , year 2019 . title Keras-textclassification . howpublished https://github.com/yongzhuo/Keras-TextClassification

  36. [38]

    , author Lee, L

    author Pang, B. , author Lee, L. , year 2005 . title Seeing stars: Exploiting class relationships for sentiment categorization with respect to rating scales , in: booktitle Proceedings of the ACL

  37. [39]

    , author Pascual, S

    author Serr \`a , J. , author Pascual, S. , author Karatzoglou, A. , year 2018 . title Towards a universal neural network encoder for time series , in: booktitle Artificial Intelligence Research and Development , pp. pages 120--129

  38. [40]

    , author Perelygin, A

    author Socher, R. , author Perelygin, A. , author Wu, J. , author Chuang, J. , author Manning, C.D. , author Ng, A. , author Potts, C. , year 2013 . title Recursive deep models for semantic compositionality over a sentiment treebank , in: booktitle Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing , publisher Associati...

  39. [41]

    , author Miikkulainen, R

    author Stanley, K.O. , author Miikkulainen, R. , year 2002 . title Evolving neural networks through augmenting topologies . journal Evolutionary Computation volume 10 , pages 99--127

  40. [42]

    , author Shazeer, N

    author Vaswani, A. , author Shazeer, N. , author Parmar, N. , author Uszkoreit, J. , author Jones, L. , author Gomez, A.N. , author Kaiser, . , author Polosukhin, I. , year 2017 . title Attention is all you need , in: booktitle Advances in Neural Information Processing Systems 30 , pp. pages 5998--6008

  41. [43]

    , author Yan, W

    author Wang, Z. , author Yan, W. , author Oates, T. , year 2017 . title Time series classification from scratch with deep neural networks: A strong baseline , in: booktitle 2017 International Joint Conference on Neural Networks (IJCNN) , pp. pages 1578--1585

  42. [44]

    , author Yang, D

    author Yang, Z. , author Yang, D. , author Dyer, C. , author He, X. , author Smola, A. , author Hovy, E. , year 2016 . title Hierarchical attention networks for document classification , in: booktitle Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies , pp. pages 1...

  43. [45]

    , author Zhao, J.J

    author Zhang, X. , author Zhao, J.J. , author LeCun, Y. , year 2015 . title Character-level convolutional networks for text classification , in: booktitle NIPS

  44. [46]

    , author Lu, H

    author Zhao, B. , author Lu, H. , author Chen, S. , author Liu, J. , author Wu, D. , year 2017 . title Convolutional neural networks for time series classification . journal Journal of Systems Engineering and Electronics volume 28 , pages 162--169

  45. [47]

    , author Liu, Q

    author Zheng, Y. , author Liu, Q. , author Chen, E. , author Ge, Y. , author Zhao, J.L. , year 2014 . title Time series classification using multi-channels deep convolutional neural networks , in: booktitle International Conference on Web-Age Information Management , pp. pages 298--310