Temporal Posed and Spontaneous Gesture Recognition from Electromyography in the Rock-Paper-Scissors Game
Pith reviewed 2026-06-30 08:06 UTC · model grok-4.3
The pith
Forearm EMG signals detect rock-paper-scissors gestures at least 800 ms before visible movement, with posed-gesture recognition reaching 63.4 percent mean accuracy.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
Two-channel forearm EMG onsets can be detected at least 800 ms before the gesture becomes visible, with peaks occurring around 342 ms prior to visible onset. Self-gesture classification yields a mean accuracy of 63.4 percent for posed gestures and 53.6 percent when a posed-trained model is applied to spontaneous gestures. Recognition from the opponent's EMG reaches a peak mean accuracy of 65 percent, occurring 2082 ms after visual onset of the gesture.
What carries the argument
Two-channel electromyography from the forearm, used to extract onset timing, peak timing, and classification features for posed versus spontaneous rock-paper-scissors gestures during dyadic play.
If this is right
- Muscle activation precedes visible movement by hundreds of milliseconds, creating a predictive window for real-time gesture systems.
- Posed-gesture training data supports moderate recognition of spontaneous gestures without retraining.
- Opponent EMG signals encode information about the observed gesture through interaction dynamics.
- The reported accuracies indicate usable performance for applications needing low-latency intent detection.
- Refinements in onset detection could further extend the lead time available before movement occurs.
Where Pith is reading between the lines
- The same EMG timing advantage might apply to other rapid hand gestures in human-computer interfaces beyond the RPS task.
- High individual variation implies that personalized calibration would be needed for reliable deployment across users.
- Combining the early EMG signal with visual or inertial data could raise overall accuracy above the single-modality figures shown.
- Assistive devices for users with motor impairments could use similar pre-movement detection to reduce perceived latency.
Load-bearing premise
Individual differences in muscle activation and the variability of spontaneous gestures still allow consistent recognition across users and conditions.
What would settle it
A replication in which EMG onsets cannot be detected more than 100 ms before visible gesture onset across new participants, or in which posed-to-spontaneous transfer accuracy falls to chance level, would falsify the reported temporal and recognition advantages.
Figures
read the original abstract
The importance of gesture recognition has been acknowledged in many domains requiring real-time recognition systems. Two requirements for these are fast recognition in multiuser contexts. Therefore, we explored the temporal characteristics of electromyography (EMG) and its accuracy in recognizing gestures in a Rock-Paper-Scissors (RPS) game. Twenty-four participants played RPS in dyads, while a two-channel EMG was recorded from the forearm. We found out that EMG onsets could be detected at least 800 ms before the gesture's visible onset, and that the EMG peaks around 342 ms before the visible onset of the gesture. Furthermore, we evaluated self-gesture recognition in both posed and spontaneous gesture conditions. The mean accuracy for posed gestures reached 63.4%. The model trained on posed gestures achieved 53.6% for spontaneous gestures, with considerable variation across individuals. We also checked whether detecting a player's gesture from the opponent's EMG was possible. The peak mean accuracy was 65%, peaking at 2082 ms after the visual onset of the gesture. This suggests that the opponent's reaction to an observed gesture contains information about the observed gesture due to the dynamics of the interactions while playing. The temporal predictive advantage of EMG signals, where muscle activation precedes observable movement, offers potential benefits for applications requiring rapid intent recognition, such as human-computer interaction and assistive technologies. Future work should focus on refining onset detection and reducing the impact of spontaneous movement variability across conditions to improve recognition performance in dynamic and real-world environments.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript explores temporal properties of two-channel forearm EMG during dyadic rock-paper-scissors play with 24 participants. It claims EMG onsets precede visible gesture onsets by at least 800 ms (peaking 342 ms earlier), reports 63.4% mean accuracy for posed-gesture self-recognition, 53.6% accuracy when a model trained on posed gestures is tested on spontaneous gestures, and 65% peak accuracy for inferring a player's gesture from the opponent's EMG (peaking 2082 ms after visual onset). The work positions these temporal and cross-condition results as enabling faster intent recognition in HCI and assistive applications.
Significance. If the reported temporal lead and transfer accuracies prove robust under proper validation, the results would strengthen the case for EMG-based anticipatory gesture systems by quantifying a predictive window before visible movement and by testing generalization from posed to spontaneous conditions. The inclusion of opponent-EMG analysis adds a novel interaction-dynamic angle not commonly addressed in single-user EMG gesture papers.
major comments (3)
- [Abstract / Results] Abstract and Results: The 53.6% posed-to-spontaneous mean accuracy and the 63.4% posed accuracy are presented without standard deviation, per-subject scores, range, or any indication of whether evaluation was subject-dependent or subject-independent. Given the explicit statement of 'considerable variation across individuals,' the absence of these statistics prevents evaluation of whether recognition remains above chance for the majority of users; this directly undermines the central claim of reliable recognition.
- [Methods] Methods: No description is given of the EMG preprocessing pipeline, feature set extracted from the two channels, classifier architecture, training procedure, or cross-validation scheme used to obtain the reported accuracies. Without these details it is impossible to assess whether the 63.4%, 53.6%, and 65% figures reflect genuine generalization or overfitting to subject-specific patterns.
- [Results] Results (onset detection): The claims that EMG onsets occur 'at least 800 ms before' visible onset and peak 'around 342 ms before' are stated without the detection algorithm, amplitude or derivative threshold, false-positive control, or any statistical comparison against a null model of random timing. These numbers are load-bearing for the paper's emphasis on temporal predictive advantage.
minor comments (2)
- [Abstract] Abstract: The sentence 'we found out that' is colloquial; replace with 'we observed that' or 'EMG onsets were detected'.
- [Results] The manuscript would benefit from a table or figure summarizing per-subject accuracies or at least reporting standard deviation alongside all mean accuracies.
Simulated Author's Rebuttal
We thank the referee for the constructive comments highlighting areas where the manuscript requires greater transparency. We agree that the current version lacks critical details on statistics, methods, and onset detection, which we will address through revisions to strengthen the paper.
read point-by-point responses
-
Referee: [Abstract / Results] Abstract and Results: The 53.6% posed-to-spontaneous mean accuracy and the 63.4% posed accuracy are presented without standard deviation, per-subject scores, range, or any indication of whether evaluation was subject-dependent or subject-independent. Given the explicit statement of 'considerable variation across individuals,' the absence of these statistics prevents evaluation of whether recognition remains above chance for the majority of users; this directly undermines the central claim of reliable recognition.
Authors: We acknowledge the omission. The reported figures are subject-averaged means, but per-subject breakdowns, standard deviations, ranges, and the evaluation scheme (subject-dependent vs. independent) were not included. In revision we will add these statistics, report the fraction of participants exceeding chance, and clarify the cross-validation protocol to demonstrate that recognition is reliable for the majority despite individual variation. revision: yes
-
Referee: [Methods] Methods: No description is given of the EMG preprocessing pipeline, feature set extracted from the two channels, classifier architecture, training procedure, or cross-validation scheme used to obtain the reported accuracies. Without these details it is impossible to assess whether the 63.4%, 53.6%, and 65% figures reflect genuine generalization or overfitting to subject-specific patterns.
Authors: We agree that the Methods section is insufficiently detailed. The submitted manuscript omitted the preprocessing steps, extracted features, classifier, training procedure, and validation scheme. We will expand the Methods section with a complete description of the pipeline, feature extraction from the two EMG channels, model architecture, training, and cross-validation approach so that generalization versus overfitting can be properly evaluated. revision: yes
-
Referee: [Results] Results (onset detection): The claims that EMG onsets occur 'at least 800 ms before' visible onset and peak 'around 342 ms before' are stated without the detection algorithm, amplitude or derivative threshold, false-positive control, or any statistical comparison against a null model of random timing. These numbers are load-bearing for the paper's emphasis on temporal predictive advantage.
Authors: We recognize that the onset-detection procedure is not described. The 800 ms and 342 ms figures derive from our analysis, but the algorithm, thresholds, false-positive controls, and any null-model comparison are absent. In revision we will add a dedicated subsection detailing the detection method, thresholds, controls, and statistical comparison to random timing to substantiate the temporal claims. revision: yes
Circularity Check
No circularity: empirical measurements and cross-condition model evaluation
full rationale
The paper reports direct experimental results: EMG onset/peak timing measured from recorded signals, and accuracies from models trained on posed gestures and evaluated on spontaneous gestures (or cross-player). No equations, derivations, fitted parameters renamed as predictions, or self-citation load-bearing steps appear. Standard train/test splits on collected data do not reduce to inputs by construction.
Axiom & Free-Parameter Ledger
Reference graph
Works this paper leans on
-
[1]
Continuous body and hand gesture recognition for natural human-computer interaction,
Y . Song, D. Demirdjian, and R. Davis, “Continuous body and hand gesture recognition for natural human-computer interaction,”ACM Trans. Interact. Intell. Syst., vol. 2, no. 1, pp. 5:1–5:28, 2012. [Online]. Available: https://dl.acm.org/doi/10.1145/2133366.2133371
-
[2]
V . Chang, R. O. Eniola, L. Golightly, and Q. A. Xu, “An Exploration into Human–Computer Interaction: Hand Gesture Recognition Management in a Challenging Environment,”SN Computer Science, vol. 4, no. 5, p. 441, Jun. 2023. [Online]. Available: https://doi.org/10.1007/s42979- 023-01751-y
-
[3]
Gesticulation in individuals with at risk mental states for psychosis,
A. C. Lopes-Rocha, W. H. de Paula Ramos, F. Argolo, J. M. Gondim, N. B. Mota, J. C. Andrade, A. F. Jafet, M. W. de Medeiros, M. H. Serpa, G. Cecchi, A. Ara, W. F. Gattaz, C. M. Corcoran, and A. A. Loch, “Gesticulation in individuals with at risk mental states for psychosis,”Schizophrenia, vol. 9, no. 1, pp. 1–7, May 2023, publisher: Nature Publishing Grou...
2023
-
[4]
H. Chen, X. Liu, X. Li, H. Shi, and G. Zhao, “Analyze Spontaneous Gestures for Emotional Stress State Recognition: A Micro-gesture Dataset and Analysis with Deep Learning,” in2019 14th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2019), May 2019, pp. 1–8. [Online]. Available: https://ieeexplore.ieee.org/abstract/document/8756513
-
[5]
Y . Han, X. Fan, R. Bhosale, R. Sundaram, and J. Liaw, “Fusion-based Spatiotemporal Convolutions with Constant Temporal Snapshots for Sign Language Recognition,” in2021 16th IEEE International Conference on Automatic Face and Gesture Recognition (FG 2021), Feb. 2021, pp. 01–08. [Online]. Available: https://ieeexplore.ieee.org/document/9666973
-
[6]
A constructive approach for developing interactive humanoid robots,
T. Kanda, H. Ishiguro, M. Imai, T. Ono, and K. Mase, “A constructive approach for developing interactive humanoid robots,” inIEEE/RSJ International Conference on Intelligent Robots and Systems, vol. 2, Sep. 2002, pp. 1265–1270 vol.2
2002
-
[7]
Tracking and recognition of a human hand in dynamic motion for Janken (rock-paper- scissors) robot,
K. Ito, T. Sueishi, Y . Yamakawa, and M. Ishikawa, “Tracking and recognition of a human hand in dynamic motion for Janken (rock-paper- scissors) robot,” in2016 IEEE International Conference on Automation Science and Engineering (CASE), Aug. 2016, pp. 891–896, iSSN: 2161- 8089
2016
-
[8]
Developing a Lightweight Rock-Paper-Scissors Framework for Human- Robot Collaborative Gaming,
H. Brock, J. Ponce Chulani, L. Merino, D. Szapiro, and R. Gomez, “Developing a Lightweight Rock-Paper-Scissors Framework for Human- Robot Collaborative Gaming,”IEEE Access, vol. 8, pp. 202 958– 202 968, 2020, conference Name: IEEE Access
2020
-
[9]
Smile action unit detection from distal wearable electromyography and computer vision,
M. Perusqu ´ıa-Hern´andez, F. Dollack, C. K. Tan, S. Namba, S. Ayabe- Kanamura, and K. Suzuki, “Smile action unit detection from distal wearable electromyography and computer vision,” in2021 16th IEEE International Conference on Automatic Face and Gesture Recognition (FG 2021), 2021, pp. 1–8
2021
-
[10]
Body gesture and head movement analyses in dyadic parent-child interaction as indicators of relationship,
S. Alghowinem, H. Chen, C. Breazeal, and H. W. Park, “Body gesture and head movement analyses in dyadic parent-child interaction as indicators of relationship,” in2021 16th IEEE International Conference on Automatic Face and Gesture Recognition (FG 2021), 2021, pp. 01– 05
2021
-
[11]
M. Z. Hossain, T. Gedeon, and R. Sankaranarayana, “Using Temporal Features of Observers’ Physiological Measures to Distinguish Between Genuine and Fake Smiles,”IEEE Transactions on Affective Computing, vol. 11, no. 1, pp. 163–173, Jan. 2020, conference Name: IEEE Transactions on Affective Computing. [Online]. Available: https://ieeexplore.ieee.org/abstrac...
-
[12]
Hand gesture recognition for human computer interaction,
M. Panwar and P. Singh Mehra, “Hand gesture recognition for human computer interaction,” in2011 International Conference on Image Information Processing, Jan. 2011, pp. 1–7. [Online]. Available: https://ieeexplore.ieee.org/document/6108940
-
[13]
Rock-paper-scissors prediction ex- periments using muscle activations,
G. Jang, Y . Choi, and Z. Qu, “Rock-paper-scissors prediction ex- periments using muscle activations,” in2012 IEEE/RSJ International Conference on Intelligent Robots and Systems, Oct. 2012, pp. 5133– 5134, iSSN: 2153-0866
2012
-
[14]
On-line recognition of finger motions using wrist EMG and simple-PCA,
T. Funabashi, M. Ito, S.-i. Ito, and M. Fukumi, “On-line recognition of finger motions using wrist EMG and simple-PCA,” in2015 10th Asian Control Conference (ASCC), May 2015, pp. 1–5. [Online]. Available: https://ieeexplore.ieee.org/document/7244634
-
[15]
Rock-paper-scissors with Myo Armband pose detection,
Y . Ploengpit and T. Phienthrakul, “Rock-paper-scissors with Myo Armband pose detection,” in2016 International Computer Science and Engineering Conference (ICSEC), Dec. 2016, pp. 1–5. [Online]. Available: https://ieeexplore.ieee.org/document/7859949
-
[16]
Classification of rock-paper-scissors using electromyography and multi-layer perceptron,
T. Gang, Y . Cho, and Y . Choi, “Classification of rock-paper-scissors using electromyography and multi-layer perceptron,” in2017 14th In- ternational Conference on Ubiquitous Robots and Ambient Intelligence (URAI), Jun. 2017, pp. 406–407
2017
-
[17]
The timing of facial motion in posed and spontaneous smiles,
J. F. Cohn and K. L. Schmidt, “The timing of facial motion in posed and spontaneous smiles,”International Journal of Wavelets, Multiresolution and Information Processing, vol. 02, no. 02, pp. 121–132, Jun. 2004, publisher: World Scientific Publishing Co. [Online]. Available: https://www.worldscientific.com/doi/abs/10.1142/S021969130400041X
-
[18]
Effects of action observation on corticospinal excitability: Muscle specificity, direction, and timing of the mirror response,
K. R. Naish, C. Houston-Price, A. J. Bremner, and N. P. Holmes, “Effects of action observation on corticospinal excitability: Muscle specificity, direction, and timing of the mirror response,” Neuropsychologia, vol. 64, pp. 331–348, Nov. 2014. [Online]. Available: https://www.sciencedirect.com/science/article/pii/S002839321400339X
2014
-
[19]
Muscle Activation Onset Latencies and Amplitudes during Lane Change in a Full Vehicle Test,
P. Huber, M. Christova, G. A. D’Addetta, E. Gallasch, S. Kirschbichler, C. Mayer, A. Pr ¨uggler, A. Rieser, W. Sinz, and D. Wallner, “Muscle Activation Onset Latencies and Amplitudes during Lane Change in a Full Vehicle Test,”2013 IRCOBI Conference Proceedings - International Research Council on the Biomechanics of Injury, 2013
2013
-
[20]
The invisible potential of facial electromyography: A comparison of emg and computer vision when distinguishing posed from sponta- neous smiles,
M. Perusqu ´ıa-Hern´andez, S. Ayabe-Kanamura, K. Suzuki, and S. Ku- mano, “The invisible potential of facial electromyography: A comparison of emg and computer vision when distinguishing posed from sponta- neous smiles,” inProceedings of the 2019 CHI Conference on Human Factors in Computing Systems. Association for Computing Machinery, 2019
2019
-
[21]
AutoGluon-Tabular: Robust and Accurate AutoML for Structured Data
N. Erickson, J. Mueller, A. Shirkov, H. Zhang, P. Larroy, M. Li, and A. Smola, “AutoGluon-Tabular: Robust and Accurate AutoML for Structured Data,” Mar. 2020, arXiv:2003.06505 [stat]. [Online]. Available: http://arxiv.org/abs/2003.06505
work page internal anchor Pith review Pith/arXiv arXiv 2020
-
[22]
Emotion recognition in spontaneous and acted dialogues,
L. Tian, J. D. Moore, and C. Lai, “Emotion recognition in spontaneous and acted dialogues,” in2015 International Conference on Affective Computing and Intelligent Interaction (ACII), Sep. 2015, pp. 698–704, iSSN: 2156-8111
2015
-
[23]
Human perception and biosignal-based identification of posed and spontaneous smiles,
M. Perusqu ´ıa-Hern´andez, S. Ayabe-Kanamura, and K. Suzuki, “Human perception and biosignal-based identification of posed and spontaneous smiles,”PLOS ONE, vol. 14, no. 12, p. e0226328, Dec. 2019, publisher: Public Library of Science
2019
-
[24]
RealSmileNet: A Deep End-to-End Network for Spontaneous and Posed Smile Recogni- tion,
Y . Yang, M. Z. Hossain, T. Gedeon, and S. Rahman, “RealSmileNet: A Deep End-to-End Network for Spontaneous and Posed Smile Recogni- tion,” inComputer Vision – ACCV 2020: 15th Asian Conference on Computer Vision, Kyoto, Japan, November 30 – December 4, 2020, Revised Selected Papers, Part V. Berlin, Heidelberg: Springer-Verlag, 2020, pp. 21–37
2020
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.