pith. sign in

arxiv: 2605.26473 · v1 · pith:WEZ6WGOCnew · submitted 2026-05-26 · 📡 eess.SY · cs.SY

Orion: Enabling Self-adaptive Memory Management for On-device Online Continual Learning

Pith reviewed 2026-06-29 16:23 UTC · model grok-4.3

classification 📡 eess.SY cs.SY
keywords online continual learningmemory managementon-device learningself-adaptive systemsrobotic applicationsbuckets effectreplay bufferstraining latency
0
0 comments X

The pith

Orion uses a buckets-effect runtime indicator to dynamically reallocate memory across online continual learning components for on-device robotic deployment.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

Online continual learning must adapt to new data in real time on memory-limited devices such as robots, but fixed memory allocations create unacceptable trade-offs in speed, plasticity, and stability. Orion supplies a self-adaptive framework that continuously redistributes memory among batch processing, replay buffers, and optimizers at both operating-system and application layers. The mechanism rests on URGE, an indicator that identifies the current scarcest resource and triggers coordinated reallocation plus prefetching. Prototype tests across multiple algorithms, benchmarks, and robotic hardware show faster training with preserved balance and low added cost. Integration into an actual autonomous navigation robot confirms practical feasibility under changing workloads.

Core claim

Orion is a holistic framework that leverages URGE, a unified runtime indicator grounded in the Buckets effect, to dynamically reallocate memory across OCL components by jointly coordinating batch processing, replay buffers, and optimization strategies at both the OS and application level, together with system-level data prefetching, thereby enabling feasible on-device deployment while achieving significant training speedups with minimal runtime, memory, and energy overhead.

What carries the argument

URGE, a unified runtime indicator grounded in the Buckets effect principle that system performance is bounded by its scarcest resource, which dynamically coordinates memory reallocation across OCL components.

If this is right

  • State-of-the-art OCL algorithms become deployable on memory-constrained robotic hardware without offline retuning.
  • Training latency decreases while plasticity-stability balance is preserved across changing data distributions.
  • Memory pressure shifts during learning are handled automatically rather than through manual intervention.
  • System-level prefetching further reduces I/O stalls without extra application changes.
  • The same prototype integrates with Avalanche-lib and runs on standard autonomous-robot platforms.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • Similar indicator-driven reallocation could be tested on other edge devices such as drones or wearable sensors facing variable memory pressure.
  • The buckets-effect logic may generalize to joint management of compute, bandwidth, and energy in multi-task on-device learning.
  • Developers could explore whether URGE-style indicators improve non-continual on-device training under tight memory caps.

Load-bearing premise

URGE can reliably coordinate memory reallocation across OCL components at OS and application levels without introducing instability or unacceptable overhead in real robotic workloads.

What would settle it

A controlled run on the target robotic platform in which URGE-driven reallocation produces either measurable instability, increased energy draw, or slower convergence compared with static allocation under identical data streams.

Figures

Figures reproduced from arXiv: 2605.26473 by Cong Liu, Nikil Dutt, Zexin Li.

Figure 1
Figure 1. Figure 1: No silver bullet. Visualization of the complex tradeoff space among three key performance metrics of on-device OCL. Memory-agnostic co-optimizing training latency, plasticity, and stability could easily raise out-of-memory (OOM) concerns under stringent memory constraints. resource-constrained platforms [2]. Memory selection (GSS) favors informative samples [5]. Constraint￾and regularization-based methods,… view at source ↗
Figure 2
Figure 2. Figure 2: Impact of key hyperparameter choices on OCL metrics under memory constraints. [PITH_FULL_IMAGE:figures/full_fig_p004_2.png] view at source ↗
Figure 3
Figure 3. Figure 3: Design overview of Orion. as URGEt = 1 1 + e kp(Pt−Pth) · 1 1 + e ks(St−Sth) · 1 1 + e−kl(Lt−Lth) · 1 1 + e km(Mt−Mmax) (1) where kp, ks, kl , and km are scaling factors that control the sensitivity of the URGE to changes in plasticity, stability, training latency, and memory pressure, respectively. The terms in the URGE equation (Eq. 1) reflect the system’s state. If plasticity is low, the first term is h… view at source ↗
Figure 4
Figure 4. Figure 4: Overall effectiveness of Orion using the ER algorithm evaluated on five benchmarks. Crosses indicate OOM. 11 [PITH_FULL_IMAGE:figures/full_fig_p011_4.png] view at source ↗
Figure 5
Figure 5. Figure 5: Overall effectiveness of Orion on four OCL algorithms on the CORe50-NC benchmark. Crosses indicate OOM. 5.2 Overall Effectiveness This section evaluates the effectiveness of Orion across three platforms, benchmarks, and OCL algorithms. The evaluation is divided into three subsets: (1) performance across benchmarks using a state-of-the-art OCL algorithm, (2) performance across representative OCL algorithms … view at source ↗
Figure 6
Figure 6. Figure 6: A realistic case study based on an NVIDIA Jetson GPU-enabled autonomous navigation robot [PITH_FULL_IMAGE:figures/full_fig_p016_6.png] view at source ↗
Figure 7
Figure 7. Figure 7: Left: Memory breakdown on GSS algorithm in the case study. The red line represents Xavier’s maximum memory. Right: System-level memory usage profiling for GSS algorithm. The red line represents Xavier’s maximum memory. Red crosses indicate the OOM point. Orion to achieve substantial gains in training latency performance while maintaining acceptable plasticity and stability. 5.5 Overhead Analysis Execution … view at source ↗
Figure 8
Figure 8. Figure 8: Ablation study of data prefetching [PITH_FULL_IMAGE:figures/full_fig_p017_8.png] view at source ↗
read the original abstract

Online continual learning (OCL) enables real-time adaptation to new data, making it crucial for dynamic robotic applications. However, its practical deployment is hindered by memory constraints in resource-limited systems, which affect key trade-offs in training latency, plasticity, and stability. Unlike offline parameter tuning, which cannot account for the dynamic shift in memory pressure and workload complexity as OCL progresses, an online and self-adaptive approach is essential for robust on-device deployment. This paper proposes Orion, a holistic framework designed to co-optimize training latency, plasticity, and stability of state-of-the-art OCL models under strict memory constraints, enabling feasible on-device deployment. At its core, Orion leverages URGE, a unified runtime indicator grounded in the ``Buckets effect'' principle that system performance is bounded by its scarcest resource, to dynamically reallocate memory across OCL components by jointly coordinating batch processing, replay buffers, and optimization strategies at both the OS and application level. Furthermore, Orion introduces system-level data prefetching techniques to maximize efficiency. A system prototype of Orion has been implemented using the widely adopted \texttt{Avalanche-lib} and thoroughly evaluated across a diverse range of OCL algorithms, benchmarks, and hardware platforms commonly used in autonomous robotic applications. To further demonstrate its practical utility, Orion is integrated into a realistic autonomous navigational robot powered by OCL. The results show that Orion achieves significant training speedups while maintaining balanced performance and effectively adapting to various scenarios, all with minimal runtime, memory, and energy overhead, making Orion a practical solution for on-device continual learning.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 1 minor

Summary. The paper proposes Orion, a holistic framework for co-optimizing training latency, plasticity, and stability in on-device online continual learning under memory constraints. It introduces URGE, a unified runtime indicator based on the buckets effect principle, to dynamically reallocate memory by jointly coordinating batch processing, replay buffers, and optimization strategies at both OS and application levels, along with system-level data prefetching. The framework is implemented using Avalanche-lib and evaluated across OCL algorithms, benchmarks, and hardware platforms, with additional integration into an autonomous navigational robot, claiming significant speedups with balanced performance and minimal overhead.

Significance. If the claims hold with independent measurements, Orion would represent a practical advance for on-device OCL in dynamic robotic settings by providing online self-adaptation to varying memory pressure, which static offline tuning cannot achieve. The prototype implementation, evaluation across algorithms/benchmarks/hardware, and real-robot integration are strengths that support applicability.

major comments (2)
  1. [Abstract] Abstract: the central claim that Orion 'achieves significant training speedups while maintaining balanced performance' with 'minimal runtime, memory, and energy overhead' is load-bearing for the practicality assertion, yet no quantitative results, baselines, error bars, or stability metrics (e.g., variance under memory pressure shifts) are supplied, preventing verification that numbers are independent of tuning parameters.
  2. [Evaluation] Evaluation: the assumption that URGE can jointly coordinate reallocation without instability is load-bearing for the self-adaptive claim, but no concrete metrics on robustness (such as plasticity/stability trade-off variance during sudden memory pressure changes in robotic workloads) are reported.
minor comments (1)
  1. [Abstract] The 'Buckets effect' principle is invoked without a citation or short explanation of its mapping to OCL memory components.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive feedback. We address the two major comments below and will revise the manuscript to improve clarity and support for the claims.

read point-by-point responses
  1. Referee: [Abstract] Abstract: the central claim that Orion 'achieves significant training speedups while maintaining balanced performance' with 'minimal runtime, memory, and energy overhead' is load-bearing for the practicality assertion, yet no quantitative results, baselines, error bars, or stability metrics (e.g., variance under memory pressure shifts) are supplied, preventing verification that numbers are independent of tuning parameters.

    Authors: We agree that the abstract would be strengthened by including quantitative highlights. The evaluation sections of the manuscript report detailed results with baselines, multiple runs, error bars, and overhead measurements across algorithms, benchmarks, and hardware. In the revision we will update the abstract to summarize key quantitative outcomes (e.g., observed speedups and overhead ranges) with explicit pointers to the corresponding figures and tables. revision: yes

  2. Referee: [Evaluation] Evaluation: the assumption that URGE can jointly coordinate reallocation without instability is load-bearing for the self-adaptive claim, but no concrete metrics on robustness (such as plasticity/stability trade-off variance during sudden memory pressure changes in robotic workloads) are reported.

    Authors: The referee correctly notes the absence of explicit robustness metrics for sudden memory-pressure shifts. While the current evaluation demonstrates adaptation across robotic workloads and varying conditions, it does not isolate variance in the plasticity/stability trade-off under abrupt pressure changes. We will add these metrics in a revised evaluation section, including targeted analysis or additional runs that quantify the trade-off variance during memory-pressure transitions. revision: yes

Circularity Check

0 steps flagged

No significant circularity; claims rest on independent empirical evaluation

full rationale

The abstract describes Orion as a framework that uses the URGE indicator (grounded in the external 'Buckets effect' principle) to coordinate memory reallocation across OCL components, with results from prototype evaluation on Avalanche-lib across algorithms/benchmarks/hardware and robot integration. No equations, fitted parameters, self-citations, or derivation steps are presented that would reduce claimed speedups, overhead, or stability metrics to quantities defined by the same inputs or by construction. The performance assertions appear as outcomes of system implementation and testing rather than tautological renamings or self-referential fits, making the derivation self-contained against external benchmarks.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Abstract only; no explicit free parameters, axioms, or invented entities are described. The buckets-effect principle is invoked as grounding for URGE but its precise formalization is not given.

pith-pipeline@v0.9.1-grok · 5820 in / 1154 out tokens · 37544 ms · 2026-06-29T16:23:49.651719+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

78 extracted references · 21 canonical work pages · 4 internal anchors

  1. [1]

    Continual learning with tiny episodic memories

    Arslan Chaudhry, Marcus Rohrbach, Mohamed Elhoseiny, Thalaiyasingam Ajanthan, P Dokania, P Torr, and M Ranzato. Continual learning with tiny episodic memories. InWorkshop on Multi-Task and Lifelong Reinforcement Learning, 2019

  2. [2]

    Computationally budgeted continual learning: What does matter? In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 3698– 3707, 2023

    Ameya Prabhu, Hasan Abed Al Kader Hammoud, Puneet K Dokania, Philip HS Torr, Ser-Nam Lim, Bernard Ghanem, and Adel Bibi. Computationally budgeted continual learning: What does matter? In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 3698– 3707, 2023

  3. [3]

    Gradient episodic memory for continual learning.Ad- vances in neural information processing systems, 30, 2017

    David Lopez-Paz and Marc’Aurelio Ranzato. Gradient episodic memory for continual learning.Ad- vances in neural information processing systems, 30, 2017

  4. [4]

    Efficient Lifelong Learning with A-GEM

    Arslan Chaudhry, Marc’Aurelio Ranzato, Marcus Rohrbach, and Mohamed Elhoseiny. Efficient life- long learning with a-gem.arXiv preprint arXiv:1812.00420, 2018

  5. [5]

    Gradient based sample selection for online continual learning.Advances in neural information processing systems, 32, 2019

    Rahaf Aljundi, Min Lin, Baptiste Goujaud, and Yoshua Bengio. Gradient based sample selection for online continual learning.Advances in neural information processing systems, 32, 2019

  6. [6]

    Overcoming catastrophic forgetting in neural networks.Proceedings of the national academy of sciences, 114(13): 3521–3526, 2017

    James Kirkpatrick, Razvan Pascanu, Neil Rabinowitz, Joel Veness, Guillaume Desjardins, Andrei A Rusu, Kieran Milan, John Quan, Tiago Ramalho, Agnieszka Grabska-Barwinska, et al. Overcoming catastrophic forgetting in neural networks.Proceedings of the national academy of sciences, 114(13): 3521–3526, 2017

  7. [7]

    Learning without forgetting.IEEE transactions on pattern analysis and machine intelligence, 40(12):2935–2947, 2017

    Zhizhong Li and Derek Hoiem. Learning without forgetting.IEEE transactions on pattern analysis and machine intelligence, 40(12):2935–2947, 2017

  8. [8]

    A comprehensive empirical evaluation on online continual learn- ing

    Albin Soutif-Cormerais, Antonio Carta, Andrea Cossu, Julio Hurtado, Vincenzo Lomonaco, Joost Van de Weijer, and Hamed Hemati. A comprehensive empirical evaluation on online continual learn- ing. InProceedings of the IEEE/CVF International Conference on Computer Vision, pages 3518–3528, 2023

  9. [9]

    Online active continual learning for robotic lifelong object recognition.IEEE Transactions on Neural Networks and Learning Systems, 2023

    Xiangli Nie, Zhiguang Deng, Mingdong He, Mingyu Fan, and Zheng Tang. Online active continual learning for robotic lifelong object recognition.IEEE Transactions on Neural Networks and Learning Systems, 2023

  10. [10]

    Covio: Online continual learning for visual-inertial odometry

    Niclas Vödisch, Daniele Cattaneo, Wolfram Burgard, and Abhinav Valada. Covio: Online continual learning for visual-inertial odometry. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 2464–2473, 2023. 19

  11. [11]

    Codeps: Online continual learning for depth estimation and panoptic segmentation.arXiv preprint arXiv:2303.10147, 2023

    Niclas Vödisch, Kürsat Petek, Wolfram Burgard, and Abhinav Valada. Codeps: Online continual learning for depth estimation and panoptic segmentation.arXiv preprint arXiv:2303.10147, 2023

  12. [12]

    From continual learning to causal discovery in robotics

    Luca Castri, Sariah Mghames, and Nicola Bellotto. From continual learning to causal discovery in robotics. InAAAI Bridge Program on Continual Causality, pages 85–91. PMLR, 2023

  13. [13]

    Continual learning for au- tonomous robots: A prototype-based approach.arXiv preprint arXiv:2404.00418, 2024

    Elvin Hajizada, Balachandran Swaminathan, and Yulia Sandamirskaya. Continual learning for au- tonomous robots: A prototype-based approach.arXiv preprint arXiv:2404.00418, 2024

  14. [14]

    Online con- tinual learning in image classification: An empirical survey.Neurocomputing, 469:28–51, 2022

    Zheda Mai, Ruiwen Li, Jihwan Jeong, David Quispe, Hyunwoo Kim, and Scott Sanner. Online con- tinual learning in image classification: An empirical survey.Neurocomputing, 469:28–51, 2022

  15. [15]

    Online continual learning with maximal interfered retrieval.Advances in neural information processing systems, 32, 2019

    Rahaf Aljundi, Eugene Belilovsky, Tinne Tuytelaars, Laurent Charlin, Massimo Caccia, Min Lin, and Lucas Page-Caccia. Online continual learning with maximal interfered retrieval.Advances in neural information processing systems, 32, 2019

  16. [16]

    Real-time evaluation in online continual learn- ing: A new hope

    Yasir Ghunaim, Adel Bibi, Kumail Alhamoud, Motasem Alfarra, Hasan Abed Al Kader Hammoud, Ameya Prabhu, Philip HS Torr, and Bernard Ghanem. Real-time evaluation in online continual learn- ing: A new hope. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 11888–11897, 2023

  17. [17]

    Lifelearner: Hardware-aware meta continual learning system for embedded computing platforms.arXiv preprint arXiv:2311.11420, 2023

    Young D Kwon, Jagmohan Chauhan, Hong Jia, Stylianos I Venieris, and Cecilia Mascolo. Lifelearner: Hardware-aware meta continual learning system for embedded computing platforms.arXiv preprint arXiv:2311.11420, 2023

  18. [18]

    Ekya: Continuous learning of video analytics models on edge compute servers

    Romil Bhardwaj, Zhengxu Xia, Ganesh Ananthanarayanan, Junchen Jiang, Yuanchao Shu, Nikolaos Karianakis, Kevin Hsieh, Paramvir Bahl, and Ion Stoica. Ekya: Continuous learning of video analytics models on edge compute servers. In19th USENIX Symposium on Networked Systems Design and Implementation (NSDI 22), pages 119–135, 2022

  19. [19]

    Latent replay for real-time continual learning

    Lorenzo Pellegrini, Gabriele Graffieti, Vincenzo Lomonaco, and Davide Maltoni. Latent replay for real-time continual learning. In2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pages 10203–10209. IEEE, 2020

  20. [20]

    Saunders Philadelphia, 1971

    Eugene Pleasants Odum, Gary W Barrett, et al.Fundamentals of ecology, volume 3. Saunders Philadelphia, 1971

  21. [21]

    Avalanche: A pytorch library for deep continual learning.Journal of Machine Learning Research, 24(363):1–6, 2023

    Antonio Carta, Lorenzo Pellegrini, Andrea Cossu, Hamed Hemati, and Vincenzo Lomonaco. Avalanche: A pytorch library for deep continual learning.Journal of Machine Learning Research, 24(363):1–6, 2023. URLhttp://jmlr.org/papers/v24/23-0130.html

  22. [22]

    Hayes, Matthias De Lange, Marc Masana, Jary Pomponi, Gido van de Ven, Martin Mundt, Qi She, Keiland Cooper, Jeremy Forest, Eden Belouadah, Simone Calderara, German I

    Vincenzo Lomonaco, Lorenzo Pellegrini, Andrea Cossu, Antonio Carta, Gabriele Graffieti, Tyler L. Hayes, Matthias De Lange, Marc Masana, Jary Pomponi, Gido van de Ven, Martin Mundt, Qi She, Keiland Cooper, Jeremy Forest, Eden Belouadah, Simone Calderara, German I. Parisi, Fabio Cuz- zolin, Andreas Tolias, Simone Scardapane, Luca Antiga, Subutai Amhad, Adri...

  23. [23]

    Learning multiple layers of features from tiny images

    Alex Krizhevsky, Geoffrey Hinton, et al. Learning multiple layers of features from tiny images. 2009

  24. [24]

    Fine-grained continual learning.arXiv preprint arXiv:1907.03799, 1, 2019

    Vincenzo Lomonaco, Davide Maltoni, Lorenzo Pellegrini, et al. Fine-grained continual learning.arXiv preprint arXiv:1907.03799, 1, 2019. 20

  25. [25]

    Core50: a new dataset and benchmark for continual object recogni- tion

    V Lomanco and Davide Maltoni. Core50: a new dataset and benchmark for continual object recogni- tion. InProceedings of the 1st Annual Conference on Robot Learning, pages 17–26, 2017

  26. [26]

    A procedural world generation framework for systematic evaluation of continual learning.arXiv preprint arXiv:2106.02585, 2021

    Timm Hess, Martin Mundt, Iuliia Pliushch, and Visvanathan Ramesh. A procedural world generation framework for systematic evaluation of continual learning.arXiv preprint arXiv:2106.02585, 2021

  27. [27]

    Jetson agx xavier.https://developer.nvidia.com/embedded/ jetson-agx-xavier, 2020

    NVIDIA. Jetson agx xavier.https://developer.nvidia.com/embedded/ jetson-agx-xavier, 2020

  28. [28]

    Jetson agx orin.https://www.nvidia.com/en-us/autonomous-machines/ embedded-systems/jetson-orin/, 2022

    NVIDIA. Jetson agx orin.https://www.nvidia.com/en-us/autonomous-machines/ embedded-systems/jetson-orin/, 2022

  29. [29]

    Person re-identification for robot person following with online continual learning.IEEE Robotics and Automation Letters, 2024

    Hanjing Ye, Jieting Zhao, Yu Zhan, Weinan Chen, Li He, and Hong Zhang. Person re-identification for robot person following with online continual learning.IEEE Robotics and Automation Letters, 2024

  30. [30]

    Incremental learning of retrievable skills for efficient continual task adaptation.Advances in Neural Information Processing Systems, 37:17286–17312, 2024

    Daehee Lee, Minjong Yoo, Woo Kyung Kim, Wonje Choi, and Honguk Woo. Incremental learning of retrievable skills for efficient continual task adaptation.Advances in Neural Information Processing Systems, 37:17286–17312, 2024

  31. [31]

    Accurate, Large Minibatch SGD: Training ImageNet in 1 Hour

    Priya Goyal, Piotr Dollár, Ross Girshick, Pieter Noordhuis, Lukasz Wesolowski, Aapo Kyrola, Andrew Tulloch, Yangqing Jia, and Kaiming He. Accurate, large minibatch sgd: Training imagenet in 1 hour. arXiv preprint arXiv:1706.02677, 2017

  32. [32]

    Autoware on board: Enabling autonomous vehicles with embedded systems

    Shinpei Kato, Shota Tokunaga, Yuya Maruyama, Seiya Maeda, Manato Hirabayashi, Yuki Kitsukawa, Abraham Monrroy, Tomohito Ando, Yusuke Fujii, and Takuya Azumi. Autoware on board: Enabling autonomous vehicles with embedded systems. In2018 ACM/IEEE 9th International Conference on Cyber-Physical Systems (ICCPS), pages 287–296. IEEE, 2018

  33. [33]

    Deep learning for autonomous vehicles

    Branislav Kisa ˇcanin. Deep learning for autonomous vehicles. In2017 IEEE 47th International Sym- posium on Multiple-Valued Logic (ISMVL), pages 142–142. IEEE, 2017

  34. [34]

    Nvradarnet: Real-time radar obstacle and free space detection for autonomous driving.arXiv preprint arXiv:2209.14499, 2022

    Alexander Popov, Patrik Gebhardt, Ke Chen, Ryan Oldja, Heeseok Lee, Shane Murray, Ruchi Bhar- gava, and Nikolai Smolyanskiy. Nvradarnet: Real-time radar obstacle and free space detection for autonomous driving.arXiv preprint arXiv:2209.14499, 2022

  35. [35]

    Duckiebot (db-j).https://get.duckietown.com/products/ duckiebot-db21, 2022

    NVIDIA. Duckiebot (db-j).https://get.duckietown.com/products/ duckiebot-db21, 2022

  36. [36]

    Sparkfun jetbot ai kit.https://www.sparkfun.com/products/18486, 2022

    NVIDIA. Sparkfun jetbot ai kit.https://www.sparkfun.com/products/18486, 2022

  37. [37]

    Waveshare jetbot ai kit.https://www.amazon.com/ Waveshare-JetBot-AI-Kit-Accessories/dp/B07V8JL4TF/, 2022

    NVIDIA. Waveshare jetbot ai kit.https://www.amazon.com/ Waveshare-JetBot-AI-Kit-Accessories/dp/B07V8JL4TF/, 2022

  38. [38]

    Deep residual learning for image recog- nition

    Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. Deep residual learning for image recog- nition. InProceedings of the IEEE conference on computer vision and pattern recognition, pages 770–778, 2016

  39. [39]

    ROBOTIS, 2024

    ROBOTIS.TurtleBot3 Overview. ROBOTIS, 2024. URLhttps://emanual.robotis.com/ docs/en/platform/turtlebot3/overview/. Accessed: 15 April 2024

  40. [40]

    ImageNet: A large- scale hierarchical image database

    Jia Deng, Wei Dong, Richard Socher, Li-Jia Li, Kai Li, and Li Fei-Fei. Imagenet: A large-scale hierarchical image database. In2009 IEEE Conference on Computer Vision and Pattern Recognition, pages 248–255, 2009. doi: 10.1109/CVPR.2009.5206848. 21

  41. [41]

    Cloc: Confident initial estimation of long-term visual localization using a few sequential images in large- scale spaces.IEEE Sensors Journal, 23(8):8613–8629, 2023

    Joohyung Kim, Janghun Hyeon, Hyunga Choi, Bumchul Jang, Bokyeon Jeong, and Nakju Doh. Cloc: Confident initial estimation of long-term visual localization using a few sequential images in large- scale spaces.IEEE Sensors Journal, 23(8):8613–8629, 2023. doi: 10.1109/JSEN.2023.3253872

  42. [42]

    Continual variational autoencoder via continual generative knowledge distillation

    Fei Ye and Adrian G Bors. Continual variational autoencoder via continual generative knowledge distillation. InProceedings of the AAAI Conference on Artificial Intelligence, volume 37, pages 10918– 10926, 2023

  43. [43]

    Continual learning on a diet: Learning from sparsely labeled streams under constrained computation.arXiv preprint arXiv:2404.12766, 2024

    Wenxuan Zhang, Youssef Mohamed, Bernard Ghanem, Philip HS Torr, Adel Bibi, and Mohamed Elhoseiny. Continual learning on a diet: Learning from sparsely labeled streams under constrained computation.arXiv preprint arXiv:2404.12766, 2024

  44. [44]

    A unified architecture for accelerating distributed{DNN}training in heterogeneous{GPU/CPU}clusters

    Yimin Jiang, Yibo Zhu, Chang Lan, Bairen Yi, Yong Cui, and Chuanxiong Guo. A unified architecture for accelerating distributed{DNN}training in heterogeneous{GPU/CPU}clusters. In14th USENIX Symposium on Operating Systems Design and Implementation (OSDI 20), pages 463–479, 2020

  45. [45]

    Varuna: scalable, low-cost training of massive deep learning models

    Sanjith Athlur, Nitika Saran, Muthian Sivathanu, Ramachandran Ramjee, and Nipun Kwatra. Varuna: scalable, low-cost training of massive deep learning models. InProceedings of the Seventeenth Euro- pean Conference on Computer Systems, pages 472–487, 2022

  46. [46]

    Large scale distributed deep networks.Advances in neural information processing systems, 25, 2012

    Jeffrey Dean, Greg Corrado, Rajat Monga, Kai Chen, Matthieu Devin, Mark Mao, Marc’aurelio Ran- zato, Andrew Senior, Paul Tucker, Ke Yang, et al. Large scale distributed deep networks.Advances in neural information processing systems, 25, 2012

  47. [47]

    Megatron-LM: Training Multi-Billion Parameter Language Models Using Model Parallelism

    Mohammad Shoeybi, Mostofa Patwary, Raul Puri, Patrick LeGresley, Jared Casper, and Bryan Catan- zaro. Megatron-lm: Training multi-billion parameter language models using model parallelism.arXiv preprint arXiv:1909.08053, 2019

  48. [48]

    Pipefisher: Efficient training of large language models using pipelining and fisher information matrices.Proceedings of Machine Learning and Systems, 5, 2023

    Kazuki Osawa, Shigang Li, and Torsten Hoefler. Pipefisher: Efficient training of large language models using pipelining and fisher information matrices.Proceedings of Machine Learning and Systems, 5, 2023

  49. [49]

    Design and implemen- tation of a lightweight on-device ai-based real-time fault diagnosis system using continual learning

    Youngjun Kim, Taewan Kim, Suhyun Kim, Seongjae Lee, and Taehyoun Kim. Design and implemen- tation of a lightweight on-device ai-based real-time fault diagnosis system using continual learning. IEMEK Journal of Embedded Systems and Applications, 19(3):151–158, 2024

  50. [50]

    Contauth: Continual learning framework for behavioral-based user authentication.Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies, 4(4):1–23, 2020

    Jagmohan Chauhan, Young D Kwon, Pan Hui, and Cecilia Mascolo. Contauth: Continual learning framework for behavioral-based user authentication.Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies, 4(4):1–23, 2020

  51. [51]

    Continual learning for on-device speech recognition using disentangled con- formers

    Anuj Diwan, Ching-Feng Yeh, Wei-Ning Hsu, Paden Tomasello, Eunsol Choi, David Harwath, and Abdelrahman Mohamed. Continual learning for on-device speech recognition using disentangled con- formers. InICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and Signal Pro- cessing (ICASSP), pages 1–5. IEEE, 2023

  52. [52]

    Continual learning at the edge: Real-time training on smartphone devices.arXiv preprint arXiv:2105.13127, 2021

    Lorenzo Pellegrini, Vincenzo Lomonaco, Gabriele Graffieti, and Davide Maltoni. Continual learning at the edge: Real-time training on smartphone devices.arXiv preprint arXiv:2105.13127, 2021

  53. [53]

    Memory-efficient domain incremental learning for in- ternet of things

    Yuqing Zhao, Divya Saxena, and Jiannong Cao. Memory-efficient domain incremental learning for in- ternet of things. InProceedings of the 20th ACM Conference on Embedded Networked Sensor Systems, pages 1175–1181, 2022. 22

  54. [54]

    Online continual learning for human activity recognition.Pervasive and Mobile Computing, 93:101817, 2023

    Martin Schiemer, Lei Fang, Simon Dobson, and Juan Ye. Online continual learning for human activity recognition.Pervasive and Mobile Computing, 93:101817, 2023

  55. [55]

    Mistify: Automating{DNN}model porting for{On-Device} inference at the edge

    Peizhen Guo, Bo Hu, and Wenjun Hu. Mistify: Automating{DNN}model porting for{On-Device} inference at the edge. In18th USENIX Symposium on Networked Systems Design and Implementation (NSDI 21), pages 705–719, 2021

  56. [56]

    Gemel: Model merging for{Memory- Efficient},{Real-Time}video analytics at the edge

    Arthi Padmanabhan, Neil Agarwal, Anand Iyer, Ganesh Ananthanarayanan, Yuanchao Shu, Niko- laos Karianakis, Guoqing Harry Xu, and Ravi Netravali. Gemel: Model merging for{Memory- Efficient},{Real-Time}video analytics at the edge. In20th USENIX Symposium on Networked Systems Design and Implementation (NSDI 23), pages 973–994, 2023

  57. [57]

    Once-for-all: Train one network and specialize it for efficient deployment.arXiv preprint arXiv:1908.09791,

    Han Cai, Chuang Gan, Tianzhe Wang, Zhekai Zhang, and Song Han. Once-for-all: Train one network and specialize it for efficient deployment.arXiv preprint arXiv:1908.09791, 2019

  58. [58]

    Continual prune- and-select: class-incremental learning with specialized subnetworks.Applied Intelligence, 53(14): 17849–17864, 2023

    Aleksandr Dekhovich, David MJ Tax, Marcel HF Sluiter, and Miguel A Bessa. Continual prune- and-select: class-incremental learning with specialized subnetworks.Applied Intelligence, 53(14): 17849–17864, 2023

  59. [59]

    Esacl: Efficient continual learning of sparse models.arXiv preprint arXiv:2401.05667, 2024

    Weijieying Ren and Vasant G Honavar. Esacl: Efficient continual learning of sparse models.arXiv preprint arXiv:2401.05667, 2024

  60. [60]

    Impact patterns of combining model pruning and continual learning on model performance

    Xueyang Zhang, Hang Li, Xi Chen, and Xue Liu. Impact patterns of combining model pruning and continual learning on model performance. In2021 IEEE Third International Conference on Cognitive Machine Intelligence (CogMI), pages 27–33. IEEE, 2021

  61. [61]

    Continual learn- ing for task-oriented dialogue system with iterative network pruning, expanding and masking.arXiv preprint arXiv:2107.08173, 2021

    Binzong Geng, Fajie Yuan, Qiancheng Xu, Ying Shen, Ruifeng Xu, and Min Yang. Continual learn- ing for task-oriented dialogue system with iterative network pruning, expanding and masking.arXiv preprint arXiv:2107.08173, 2021

  62. [62]

    Sparcl: Sparse continual learning on the edge.Advances in Neural Information Processing Systems, 35:20366–20380, 2022

    Zifeng Wang, Zheng Zhan, Yifan Gong, Geng Yuan, Wei Niu, Tong Jian, Bin Ren, Stratis Ioannidis, Yanzhi Wang, and Jennifer Dy. Sparcl: Sparse continual learning on the edge.Advances in Neural Information Processing Systems, 35:20366–20380, 2022

  63. [63]

    Learn it or leave it: module composition and pruning for continual learning.arXiv preprint arXiv:2406.18708, 2024

    Mingyang Wang, Heike Adel, Lukas Lange, Jannik Strötgen, and Hinrich Schütze. Learn it or leave it: module composition and pruning for continual learning.arXiv preprint arXiv:2406.18708, 2024

  64. [64]

    Positcl: Compact continual learning with posit aware quantization

    Vedant Karia, Abdullah Zyarah, and Dhireesha Kudithipudi. Positcl: Compact continual learning with posit aware quantization. InProceedings of the Great Lakes Symposium on VLSI 2024, pages 645–650, 2024

  65. [65]

    Continual learning via bit-level information preserving

    Yujun Shi, Li Yuan, Yunpeng Chen, and Jiashi Feng. Continual learning via bit-level information preserving. InProceedings of the IEEE/CVF conference on Computer Vision and Pattern Recognition, pages 16674–16683, 2021

  66. [66]

    A tinyml platform for on-device continual learning with quantized latent replays.IEEE Journal on Emerging and Selected Topics in Circuits and Systems, 11(4):789–802, 2021

    Leonardo Ravaglia, Manuele Rusci, Davide Nadalini, Alessandro Capotondi, Francesco Conti, and Luca Benini. A tinyml platform for on-device continual learning with quantized latent replays.IEEE Journal on Emerging and Selected Topics in Circuits and Systems, 11(4):789–802, 2021

  67. [67]

    Andreas Griewank and Andrea Walther. Algorithm 799: revolve: an implementation of checkpointing for the reverse or adjoint mode of computational differentiation.ACM Transactions on Mathematical Software (TOMS), 26(1):19–45, 2000. 23

  68. [68]

    Cost- effective on-device continual learning over memory hierarchy with miro

    Xinyue Ma, Suyeon Jeong, Minjia Zhang, Di Wang, Jonghyun Choi, and Myeongjae Jeon. Cost- effective on-device continual learning over memory hierarchy with miro. InProceedings of the 29th Annual International Conference on Mobile Computing and Networking, pages 1–15, 2023

  69. [69]

    Gpipe: Efficient training of giant neural networks using pipeline parallelism.Advances in neural information processing systems, 32, 2019

    Yanping Huang, Youlong Cheng, Ankur Bapna, Orhan Firat, Dehao Chen, Mia Chen, HyoukJoong Lee, Jiquan Ngiam, Quoc V Le, Yonghui Wu, et al. Gpipe: Efficient training of giant neural networks using pipeline parallelism.Advances in neural information processing systems, 32, 2019

  70. [70]

    Melon: Breaking the memory wall for resource-efficient on-device machine learning

    Qipeng Wang, Mengwei Xu, Chao Jin, Xinran Dong, Jinliang Yuan, Xin Jin, Gang Huang, Yunxin Liu, and Xuanzhe Liu. Melon: Breaking the memory wall for resource-efficient on-device machine learning. InProceedings of the 20th Annual International Conference on Mobile Systems, Applications and Services, pages 450–463, 2022

  71. [71]

    Training Deep Nets with Sublinear Memory Cost

    Tianqi Chen, Bing Xu, Chiyuan Zhang, and Carlos Guestrin. Training deep nets with sublinear memory cost.arXiv preprint arXiv:1604.06174, 2016

  72. [72]

    E2-train: Training state-of-the-art cnns with over 80% energy savings.Advances in Neural Information Processing Systems, 32, 2019

    Yue Wang, Ziyu Jiang, Xiaohan Chen, Pengfei Xu, Yang Zhao, Yingyan Lin, and Zhangyang Wang. E2-train: Training state-of-the-art cnns with over 80% energy savings.Advances in Neural Information Processing Systems, 32, 2019

  73. [73]

    Selective freezing for efficient continual learning

    Amelia Sorrenti, Giovanni Bellitto, Federica Proietto Salanitri, Matteo Pennisi, Concetto Spamp- inato, and Simone Palazzo. Selective freezing for efficient continual learning. InProceedings of the IEEE/CVF International Conference on Computer Vision, pages 3550–3559, 2023

  74. [74]

    Egeria: Efficient dnn training with knowledge-guided layer freezing

    Yiding Wang, Decang Sun, Kai Chen, Fan Lai, and Mosharaf Chowdhury. Egeria: Efficient dnn training with knowledge-guided layer freezing. InProceedings of the Eighteenth European Conference on Computer Systems, pages 851–866, 2023

  75. [75]

    Sequel: A continual learning library in pytorch and jax.arXiv preprint arXiv:2304.10857, 2023

    Nikolaos Dimitriadis, Francois Fleuret, and Pascal Frossard. Sequel: A continual learning library in pytorch and jax.arXiv preprint arXiv:2304.10857, 2023

  76. [76]

    Continuum: Simple management of complex continual learn- ing scenarios.arXiv preprint arXiv:2102.06253, 2021

    Arthur Douillard and Timothée Lesort. Continuum: Simple management of complex continual learn- ing scenarios.arXiv preprint arXiv:2102.06253, 2021

  77. [77]

    Sequoia: A software framework to unify continual learning research.arXiv preprint arXiv:2108.01005, 2021

    Fabrice Normandin, Florian Golemo, Oleksiy Ostapenko, Pau Rodriguez, Matthew D Riemer, Julio Hurtado, Khimya Khetarpal, Ryan Lindeborg, Lucas Cecchi, Timothée Lesort, et al. Sequoia: A software framework to unify continual learning research.arXiv preprint arXiv:2108.01005, 2021

  78. [78]

    icarl: In- cremental classifier and representation learning

    Sylvestre-Alvise Rebuffi, Alexander Kolesnikov, Georg Sperl, and Christoph H Lampert. icarl: In- cremental classifier and representation learning. InProceedings of the IEEE conference on Computer Vision and Pattern Recognition, pages 2001–2010, 2017. 24