pith. sign in

arxiv: 2606.17616 · v1 · pith:KMI6FBDKnew · submitted 2026-06-16 · 💻 cs.HC

Towards Speech Impairment Prediction in German-Speaking Individuals with Amyotrophic Lateral Sclerosis

Pith reviewed 2026-06-26 23:02 UTC · model grok-4.3

classification 💻 cs.HC
keywords amyotrophic lateral sclerosisspeech impairment predictiondysarthriarepetition tasksconcordance correlation coefficientcross-sectional modelingpersonalized modelingGerman cohort
0
0 comments X

The pith

Repetition tasks like 'da-da' predict speech-related quality of life scores in German ALS patients at CCC 0.62 across speakers.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper tests whether acoustic features from everyday speech tasks can automatically predict two clinical scores of speech impairment in people with ALS. Data from 66 German-speaking patients are used to compare prediction performance across different tasks and two modeling approaches: one that works across different speakers and one that tracks changes inside the same speaker. Repetition tasks yield the strongest cross-speaker results. The work explores how such predictions might help standardize speech data collection for this patient group. If the patterns hold, automated analysis could serve as a consistent addition to traditional assessments.

Core claim

Experiments on a German-speaking cohort of 66 pwALS show that repetition tasks (/da/-/da/, /da/-/ba/) achieved the best cross-sectional performance (CCC = 0.62) for predicting the Quality of Life in the Dysarthric Speaker questionnaire, while the within-speaker setting reached a CCC of 0.86. This study represents an initial step towards speech impairment prediction in German-speaking pwALS and highlights the potential of automated speech analysis as a supportive tool for speech impairment assessment.

What carries the argument

Cross-sectional versus within-speaker modeling paradigms applied to acoustic features from speech tasks to predict clinical questionnaire scores.

If this is right

  • Repetition tasks outperform other common speech tasks for cross-sectional prediction of the target scores.
  • Within-speaker models achieve markedly higher agreement than cross-sectional models on the same data.
  • The identified tasks support efforts to standardize speech data collection protocols for ALS patients.
  • Automated analysis of these tasks can function as a supplementary method for tracking speech impairment.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The same repetition tasks could be tested in longitudinal studies to monitor progression within individual patients over months.
  • Comparable syllable repetitions might produce similar prediction accuracy when applied to ALS cohorts in other languages.
  • Mobile recording of these short tasks could enable remote, frequent checks without requiring clinic visits.

Load-bearing premise

The 66 German-speaking ALS patients and the selected speech tasks are representative enough for the reported prediction performance to apply more broadly.

What would settle it

Repeating the experiment on a new cohort of ALS patients outside Germany or with different speech tasks and obtaining substantially lower CCC values would show the results do not generalize.

read the original abstract

Amyotrophic Lateral Sclerosis (ALS) is a neurodegenerative disease, often affecting speech due to bulbar dysfunction. In this study, we predict speech impairment in people with ALS (pwALS) using two clinical speech-related scores. We evaluate cross-sectional (across speakers) and personalised (within-speaker) modelling paradigms and analyse the utility of common speech tasks to contribute to the standardisation of speech data collection for pwALS. Experiments on a German-speaking cohort of 66 pwALS show that repetition tasks (/da/-/da/, /da/-/ba/) achieved the best cross-sectional performance (Concordance Correlation Coefficient (CCC) = 0.62) for predicting the Quality of Life in the Dysarthric Speaker questionnaire, while the within-speaker setting reached a CCC of 0.86. This study represents an initial step towards speech impairment prediction in German-speaking pwALS and highlights the potential of automated speech analysis as a supportive tool for speech impairment assessment.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 1 minor

Summary. The paper claims that in a cohort of 66 German-speaking pwALS, repetition tasks (/da/-/da/, /da/-/ba/) yield the best cross-sectional prediction performance (CCC=0.62) for the Quality of Life in the Dysarthric Speaker questionnaire, while a within-speaker modelling paradigm reaches CCC=0.86; the work positions itself as an initial step toward standardizing speech data collection and automated impairment assessment.

Significance. If the performance numbers prove robust under proper validation, the study could contribute to clinical tools for ALS speech monitoring in German speakers by identifying useful speech tasks. No machine-checked proofs, reproducible code, or parameter-free derivations are described.

major comments (2)
  1. [Abstract] Abstract: the within-speaker CCC=0.86 is load-bearing for the personalised-modelling claim, yet the text supplies neither the average number of recordings per speaker nor the speaker-specific train/test split procedure, leaving open the possibility that the metric reflects overfitting rather than stable impairment signal.
  2. [Abstract] Abstract: the reported CCC values (0.62 cross-sectional, 0.86 within-speaker) cannot be assessed for soundness because no information is given on model architecture, cross-validation strategy, feature extraction pipeline, or handling of missing data.
minor comments (1)
  1. [Abstract] The abstract mentions 'two clinical speech-related scores' but only names one questionnaire; clarifying the second score would improve readability.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for their constructive comments. We address the two major comments on the abstract below.

read point-by-point responses
  1. Referee: [Abstract] Abstract: the within-speaker CCC=0.86 is load-bearing for the personalised-modelling claim, yet the text supplies neither the average number of recordings per speaker nor the speaker-specific train/test split procedure, leaving open the possibility that the metric reflects overfitting rather than stable impairment signal.

    Authors: We agree that the abstract omits these details, which are needed to evaluate the within-speaker results. The manuscript describes the data collection and the leave-one-session-out procedure used for within-speaker modeling, but the abstract does not. We will revise the abstract to state the average number of recordings per speaker and to clarify the speaker-specific split procedure. revision: yes

  2. Referee: [Abstract] Abstract: the reported CCC values (0.62 cross-sectional, 0.86 within-speaker) cannot be assessed for soundness because no information is given on model architecture, cross-validation strategy, feature extraction pipeline, or handling of missing data.

    Authors: We agree that the abstract lacks this information. We will revise the abstract to include a concise summary of the model architecture, cross-validation strategy, feature extraction pipeline, and missing-data handling, while retaining the full details in the methods section. revision: yes

Circularity Check

0 steps flagged

No circularity: empirical performance metrics computed from held-out evaluation

full rationale

The paper reports Concordance Correlation Coefficient (CCC) values obtained by training models on speech features extracted from repetition tasks and evaluating them on held-out data in both cross-sectional (across-speaker) and within-speaker paradigms. No equations, derivations, or parameter-fitting procedures are described that would make the reported predictions equivalent to the inputs by construction. No self-citations are invoked to justify uniqueness or load-bearing assumptions. The central results are direct empirical measurements on a 66-patient cohort, independent of any definitional loop.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Abstract-only review supplies no model equations, hyperparameters, or background assumptions; ledger therefore contains no entries.

pith-pipeline@v0.9.1-grok · 5731 in / 949 out tokens · 32051 ms · 2026-06-26T23:02:02.559715+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

47 extracted references · 16 canonical work pages

  1. [1]

    Median survival of people with ALS (pwALS) is 2-4 years, with respiratory failure as a major cause of mortalit y [2]

    Introduction Amyotrophic Lateral Sclerosis (ALS) is a severe progressiv e Motor Neuron Disease (MND), with an upper and lower mo- tor neuron pathology leading to reduced mobility, loss of mo - tor control, respiratory failure, and bulbar dysfunction p rob- lems [1]. Median survival of people with ALS (pwALS) is 2-4 years, with respiratory failure as a maj...

  2. [2]

    North Wind and the Sun

    Material and methods 2.1. Dataset The AIMnd 2.0 dataset is an extension of the dataset intro- duced in [19], collected at the outpatient clinic for MNDs at the Department of Neurology of the Technical University of Mu- nich University Hospital in Germany. The data collection re - ceived ethical approval (nr.2023-325-S-NP). Recording de tails are available...

  3. [3]

    This fea- ture set is widely used in similar studies and has shown stron g interpretability capacity [14, 25]

    The extended Geneva minimalistic acoustic parameter set (eGeMAPS) [23] from the openSMILE [24] toolkit. This fea- ture set is widely used in similar studies and has shown stron g interpretability capacity [14, 25]. Concretely, we use the 88 functionals and summary statistics from the 25 Low-Level De - scriptors (LLDs). These features are also easily reproduci ble

  4. [4]

    Embeddings from the transformer-based Whisper Large v3 model2 [26]. We use the 1280-dimensional encoder represen- tations and, although designed for Automatic Speech Recogn i- tion (ASR), Whisper’s internal features capture phonologi cal, linguistic, and prosodic cues across many languages, inclu ding German [27]. These properties make the encoder embeddi...

  5. [5]

    Table 2 reports cross- sectional performance, whereas Table 3 presents within-speaker CCC results

    Results Tables 2 and 3 summarise the CCC results for predicting the scores ALSFRS-R-speech and QOL-Dys. Table 2 reports cross- sectional performance, whereas Table 3 presents within-speaker CCC results. For each speech task, only the best-performing model and feature set are shown. Full results and additional metrics are available in the GitHub repository...

  6. [6]

    Across tasks and modelling settings, Whis- per embeddings combined with SVM outperform other feature and model combinations in most cases

    Discussion Regarding RQ1 ( prediction of speech impairment scores for pwALS), results on the testing set are promising for most speech tasks and for both settings, with exceptions found mostly for the speech task /a:/. Across tasks and modelling settings, Whis- per embeddings combined with SVM outperform other feature and model combinations in most cases....

  7. [7]

    Conclusion In this work, we explored speech impairment prediction in pwALS. We addressed three research questions: (i) whether acoustic features can predict speech-related ALS clinical scores both across speakers and within individuals over time; (ii) differences in performance between ALSFRS-R-speech and QOL-Dys scores; and (iii) how informative commonly...

  8. [8]

    To ensure reproducibility, the underlying code building blocks for this study are availabl e in https://github.com/monicagoma98/IS_AIMnd_2026

    Reproducibility Unfortunately, due to the ethics approval, the dataset usedin this work is not available to the public. To ensure reproducibility, the underlying code building blocks for this study are availabl e in https://github.com/monicagoma98/IS_AIMnd_2026

  9. [9]

    We also thank Pascal Hecker and Alexan- der Gebhard for their valuable feedback while preparing thi s manuscript

    Acknowledgments We thank Adria Mallol-Ragolta for his contributions at the s tart of the AIMnd project as well as Maxim Korman for his critical intellectual input. We also thank Pascal Hecker and Alexan- der Gebhard for their valuable feedback while preparing thi s manuscript. Last but not least, we thank all participants wh o took part in this study, wit...

  10. [10]

    It also assi sted code implementations for analyses relevant to this paper

    Generative AI Use Disclosure This work used Generative AI as a writing assistance tool, specifically for grammar check, improving readability, as w ell as improving the format of the tables presented. It also assi sted code implementations for analyses relevant to this paper

  11. [11]

    Motor neu- ron disease: Pathophysiology, diagnosis, and man- agement,

    L. A. Foster and M. K. Salajegheh, “Motor neu- ron disease: Pathophysiology, diagnosis, and man- agement,” The American Journal of Medicine , vol. 132, no. 1, pp. 32–37, Jan. 2019. [Online]. Available: https://doi.org/10.1016/j.amjmed.2018.07.012

  12. [12]

    Amyotrophic lateral sclerosi s,

    E. L. Feldman, S. A. Goutman, S. Petri, L. Mazzini, M. G. Sa veli- eff, P . J. Shaw, and G. Sobue, “Amyotrophic lateral sclerosi s,” Lancet, vol. 400, no. 10360, pp. 1363–1380, oct 2022

  13. [13]

    Mimics and chameleons in motor neurone disease,

    M. R. Turner and K. Talbot, “Mimics and chameleons in motor neurone disease,” Practical Neurology , vol. 13, no. 3, pp. 153–164, Jun. 2013. [Online]. Available: https://doi.org/10.1136/practneurol-2013-000557

  14. [14]

    Altered metabolism in mo- tor neuron diseases: Mechanism and potential thera- peutic target,

    C. Barone and X. Qi, “Altered metabolism in mo- tor neuron diseases: Mechanism and potential thera- peutic target,” Cells, vol. 12, no. 11, p. 1536, Jun. 2023, published: 2 June 2023. [Online]. Available: https://doi.org/10.3390/cells12111536

  15. [15]

    Assessment of disease progres- sion in motor neuron disease,

    J. M. C. Winhammar, D. B. Rowe, R. D. Henderson, and M. C. Kiernan, “Assessment of disease progres- sion in motor neuron disease,” The Lancet Neurology , vol. 4, no. 4, pp. 229–238, Apr. 2005. [Online]. Available: https://doi.org/10.1016/S1474-4422(05)70042-9

  16. [16]

    The alsfrs-r: a re- vised als functional rating scale that incorporates assess - ments of respiratory function,

    J. M. Cedarbaum, N. Stambler, E. Malta, C. Fuller, D. Hilt , B. Thurmond, and A. Nakanishi, “The alsfrs-r: a re- vised als functional rating scale that incorporates assess - ments of respiratory function,” Journal of the Neurolog- ical Sciences , vol. 169, no. 1-2, pp. 13–21, Oct. 1999, bDNF ALS Study Group (Phase III). [Online]. Available: https://doi.or...

  17. [17]

    K. L. Stipancic, Y . Y unusova, J. D. Berry, and J. R. Green, “Minimally detectable change and minimal clinically im- portant difference of a decline in sentence intelligibilit y and speaking rate for individuals with amyotrophic lateral scl e- rosis,” Journal of Speech, Language, and Hearing Research , vol. 61, no. 11, pp. 2757–2771, Nov. 2018. [Online]....

  18. [18]

    R eliabil- ity and validity of an instrument to measure quality of life i n the dysarthric speaker,

    V . Piacentini, A. Zuin, D. Cattaneo, and A. Schindler, “R eliabil- ity and validity of an instrument to measure quality of life i n the dysarthric speaker,” F olia Phoniatrica et Logopaedica , vol. 63, no. 6, pp. 289–295, Nov. 2011, epub 2011 Apr 6. [Online]. Avai l- able: https://doi.org/10.1159/000322800

  19. [19]

    Mea suring quality of life in the speaker with dysarthria: Reliability and va- lidity of the european portuguese version of the qol-dys,

    D. Nogueira, E. Reis, P . Ferreira, and A. Schindler, “Mea suring quality of life in the speaker with dysarthria: Reliability and va- lidity of the european portuguese version of the qol-dys,” F olia Phoniatrica et Logopaedica, vol. 71, pp. 1–15, 04 2019

  20. [20]

    A sys- tematic review and narrative analysis of digital speech biomarkers in motor neuron disease,

    M. Bowden, E. Beswick, J. Tam, D. Perry, A. Smith, J. Newton, S. Chandran, O. Watts, and S. Pal, “A sys- tematic review and narrative analysis of digital speech biomarkers in motor neuron disease,” npj Digital Medicine , vol. 6, no. 1, p. 228, Dec. 2023. [Online]. Available: https://doi.org/10.1038/s41746-023-00959-9

  21. [21]

    V oice analysis for neurological disorder recogniti on–a sys- tematic review and perspective on emerging trends,

    P . Hecker, N. Steckhan, F. Eyben, B. W. Schuller, and B. A rn- rich, “V oice analysis for neurological disorder recogniti on–a sys- tematic review and perspective on emerging trends,” Front. Digit. Health, vol. 4, p. 842301, 2022

  22. [22]

    Detect ing bulbar involvement in patients with amyotrophic lateral sclerosi s based on phonatory and time-frequency features,

    A. Ena, F. Clari` a, F. Solsona, and M. Povedano, “Detect ing bulbar involvement in patients with amyotrophic lateral sclerosi s based on phonatory and time-frequency features,” Sensors, vol. 22, no. 3, p. 1137, Jan

  23. [23]

    A machine-learning based objective mea- sure for als disease severity,

    F. G. Vieira, S. V enugopalan, A. S. Premasiri, M. Mc- Nally, A. Jansen, K. McCloskey, M. Brenner, and S. Perrin, “A machine-learning based objective mea- sure for als disease severity,” npj Digital Medicine , vol. 5, no. 1, p. 45, Mar. 2022. [Online]. Available: https://doi.org/10.1038/s41746-022-00588-8

  24. [24]

    Early detec- tion of als in absence of speech impairments with computer audition,

    A. Mallol-Ragolta, M. Gonzalez-Machorro, R. von Heyni tz, K. Scherzer, I. Cordts, and B. W. Schuller, “Early detec- tion of als in absence of speech impairments with computer audition,” in Artificial Intelligence in Medicine: 23rd Interna- tional Conference on Artificial Intelligence in Medicine, A IME 2025, Pavia, Italy, June 23–26, 2025, Proceedings, Part...

  25. [25]

    Multimo dal speech-based biomarkers outperform the als functional rat ing scale in predicting individual disease progression in als,

    H. Kothare, M. Neumann, and V . Ramanarayanan, “Multimo dal speech-based biomarkers outperform the als functional rat ing scale in predicting individual disease progression in als, ” in Proc. Interspeech 2025 . Rotterdam, Netherlands: ISCA, 2025, pp. 5313–5317

  26. [26]

    (2025) Speech analysis for neurodegenerati ve dis- eases challenge – SAND

    ICAR-CNR. (2025) Speech analysis for neurodegenerati ve dis- eases challenge – SAND. Organized in collaboration with Uni - versity of Naples Federico II and University of Campania Lui gi V anvitelli. Focus: speech biomarkers for ALS and neurodege n- erative diseases. IEEE ICASSP 2026 Grand Challenge. [Onlin e]. Available: https://www.sand.icar.cnr.it/

  27. [27]

    Com- puter audition for healthcare: A survey on speech analysis,

    K. Qian, Z. Zhao, Y . Tan, W. Zhang, M. Cho, C. Zhu, F. Tian, B. Hu, Y . Y amamoto, and B. W. Schuller, “Com- puter audition for healthcare: A survey on speech analysis, ” AI Open , vol. 6, pp. 244–275, 2025. [Online]. Available: https://www.sciencedirect.com/science/article/pii/S2666651

  28. [28]

    Challenges and practical guide- lines for atypical speech data collection, annotation, usa ge and sharing: A multi-project perspective,

    Z. Y ue, M. Barberis, T. Patel, J. Dineley, W. Doedens, L. Stip- donk, Y . Zhang, E. de Witte, E. Loweimi, H. V an hamme, D. Satoer, M. Ruiter, L. M. V elazquez, N. Cummins, and O. Scharenborg, “Challenges and practical guide- lines for atypical speech data collection, annotation, usa ge and sharing: A multi-project perspective,” in Proc. Inter- speech 20...

  29. [29]

    Detection of amyotrophic lateral sclerosis with computer audition: An impact analysis of different speech tasks,

    A. Mallol-Ragolta, M. Gonzalez-Machorro, R. von Heyni tz, K. Scherzer, I. Cordts, and B. Schuller, “Detection of amyotrophic lateral sclerosis with computer audition: An impact analysis of different speech tasks,” in 2025 47th Annual International Conference of the IEEE Engineer- ing in Medicine and Biology Society (EMBC) . Copen- hagen, Denmark: IEEE, 20...

  30. [30]

    Goodglass and E

    H. Goodglass and E. Kaplan, Boston Diagnostic Aphasia Exami- nation, 2nd ed. Philadelphia, PA: Lea & Febiger, 1983

  31. [31]

    V oice signals database of als patients with differ- ent dysarthria severity and healthy controls,

    R. Dubbioso, M. Spisto, L. V erde, V . V . Iuzzolino, G. Sen - erchia, E. Salvatore, G. De Pietro, I. De Falco, and G. San- nino, “V oice signals database of als patients with differ- ent dysarthria severity and healthy controls,” Scientific Data , vol. 11, no. 1, p. 800, Jul. 2024. [Online]. Available: https://doi.org/10.1038/s41597-024-03597-2

  32. [32]

    End-to-end speaker segmenta tion for overlap-aware resegmentation,

    H. Bredin and A. Laurent, “End-to-end speaker segmenta tion for overlap-aware resegmentation,” in Proc. Interspeech 2021, Brno, Czech Republic, August 2021

  33. [33]

    The geneva minimalistic acoustic paramet er set (gemaps) for voice research and affective computing,

    F. Eyben, K. R. Scherer, B. W. Schuller, J. Sundberg, E. A ndr´ e, C. Busso, L. Y . Devillers, J. Epps, P . Laukka, S. S. Narayanan , and K. P . Truong, “The geneva minimalistic acoustic paramet er set (gemaps) for voice research and affective computing,” IEEE Transactions on Affective Computing , vol. 7, no. 2, pp. 190–202, 2016

  34. [34]

    opensmile - the mu- nich versatile and fast open-source audio feature extracto r,

    F. Eyben, M. W¨ ollmer, and B. Schuller, “opensmile - the mu- nich versatile and fast open-source audio feature extracto r,” in Proceedings of the 18th ACM International Conference on Mul ti- media (ACM MM). Florence, Italy: ACM, Oct 2010, pp. 1459– 1462

  35. [35]

    Acous tic cor- relates of speech intelligibility: the usability of the eGeMAPS fea- ture set for atypical speech,

    W. Xue, C. Cucchiarini, R. van Hout, and H. Strik, “Acous tic cor- relates of speech intelligibility: the usability of the eGeMAPS fea- ture set for atypical speech,” in SLaTE 2019: 8th ISCA W orkshop on Speech and Language Technology in Education. ISCA: ISCA, Sep. 2019

  36. [36]

    Robust speech recognition via large-scale w eak supervision,

    A. Radford, J. W. Kim, T. Xu, G. Brockman, C. McLeavey, an d I. Sutskever, “Robust speech recognition via large-scale w eak supervision,” in Proceedings of the 40th International Conference on Machine Learning , ser. ICML’23, 2023. [Online]. Available: https://dl.acm.org/doi/10.5555/3618408.3619590

  37. [37]

    Advancing speech emotion recognition with whisper model embed- dings and hand-crafted audio descriptors,

    S. M. George and M. I. P , “Advancing speech emotion recognition with whisper model embed- dings and hand-crafted audio descriptors,” Franklin Open, vol. 13, p. 100403, 2025. [Online]. Available: https://www.sciencedirect.com/science/article/pii/S2773186325001914

  38. [38]

    Self-supervis ed learn- ing for classification of normal vs. dysarthric speech,

    H. Chaudhari, K. Kumar, and H. A. Patil, “Self-supervis ed learn- ing for classification of normal vs. dysarthric speech,” in 2025 Asia Pacific Signal and Information Processing Association An- nual Summit and Conference (APSIPA ASC) , 2025, pp. 1010– 1015

  39. [39]

    Unsupervised Cross-Lingual Representation Lear ning for Speech Recognition,

    A. Conneau, A. Baevski, R. Collobert, A. Mohamed, and M. Auli, “Unsupervised Cross-Lingual Representation Lear ning for Speech Recognition,” in Proceedings of the 22nd Annual Con- ference of the International Speech Communication Associa tion. Brno, Czechia: ISCA, 2021, pp. 2426–2430

  40. [40]

    Pre-trained models for detection and severity level classifica- tion of dysarthria from speech,

    F. Javanmardi, S. R. Kadiri, and P . Alku, “Pre-trained models for detection and severity level classifica- tion of dysarthria from speech,” Speech Communica- tion, vol. 158, p. 103047, 2024. [Online]. Available: https://www.sciencedirect.com/science/article/pii/S0167639324000190

  41. [41]

    Confidence intervals for evaluation in machine learning,

    L. Ferrer, “Confidence intervals for evaluation in machine learning,” https://github.com/luferrer/ConfidenceIntervals

  42. [42]

    Best practices for binary a nd ordinal data analyses,

    B. V erhulst and M. C. Neale, “Best practices for binary a nd ordinal data analyses,” Behavior Genetics , vol. 51, no. 3, pp. 204–214, May 2021, epub 2021 Jan 5. [Online]. Available: https://doi.org/10.1007/s10519-020-10031-x

  43. [43]

    Classification of als patients based on acoustic analysis of sustained vowel phonations,

    M. V ashkevich and Y . Rushkevich, “Classification of als patients based on acoustic analysis of sustained vowel phonations,” Biomedical Signal Processing and Con- trol, vol. 65, p. 102350, 2021. [Online]. Available: https://www.sciencedirect.com/science/article/pii/S1746809420304614

  44. [44]

    Deve l- opment and validation of a wav2vec 2.0-based cross-languag e methodology for measurement of articulatory precision,

    T. Talkar, K. Kawabata, C. Higgins, and S. Tobyne, “Deve l- opment and validation of a wav2vec 2.0-based cross-languag e methodology for measurement of articulatory precision,” i n Proc. Interspeech 2025 . ISCA, Aug. 2025, pp. 3748–3752, rotter- dam, The Netherlands, August 17–21, 2025. [Online]. Availa ble: https://doi.org/10.21437/Interspeech.2025-2162

  45. [45]

    Automated speech analytics in als: higher sensitivity of d igital articulatory precision over the alsfrs-r,

    G. Stegmann, C. Krantsevich, J. Liss, S. Charles, M. Bar tlett, J. Shefner, S. Rutkove, K. Kawabata, T. Talkar, and V . Berish a, “Automated speech analytics in als: higher sensitivity of d igital articulatory precision over the alsfrs-r,” Amyotrophic Lat- eral Sclerosis and Frontotemporal Degeneration , vol. 25, no. 7-8, pp. 767–775, Oct. 2024. [Online]....

  46. [46]

    Clarity ai: A c om- prehensive checklist integrating established frameworks for en- hanced research quality in medical ai studies,

    L. Marconi, E. Pirovano, and F. Cabitza, “Clarity ai: A c om- prehensive checklist integrating established frameworks for en- hanced research quality in medical ai studies,” in Proceedings of the 3rd AIxIA W orkshop on Artificial Intelligence F or Heal th- care (HC@AIxIA 2024) , ser. CEUR Workshop Proceedings, vol

  47. [47]

    2024, co-located with the 23rd International Conference of the Italian Assoc ia- tion for Artificial Intelligence (AIxIA 2024)

    Bolzano, Italy: CEUR-WS.org, Nov. 2024, co-located with the 23rd International Conference of the Italian Assoc ia- tion for Artificial Intelligence (AIxIA 2024). [Online]. Av ailable: https://ceur-ws.org/Vol-3880/paper1.pdf