GlossAssist -- A Tool to Simplify Corpus Creation and Study the Effect of NLP Models in Low-Resource Documentation Settings

Alexis Palmer; Bhargav Shandilya; Matt Buchholz

arxiv: 2606.04367 · v1 · pith:CBAHIVY7new · submitted 2026-06-03 · 💻 cs.CL · cs.HC

GlossAssist -- A Tool to Simplify Corpus Creation and Study the Effect of NLP Models in Low-Resource Documentation Settings

Bhargav Shandilya , Matt Buchholz , Alexis Palmer This is my paper

Pith reviewed 2026-06-28 06:51 UTC · model grok-4.3

classification 💻 cs.CL cs.HC

keywords interlinear glossed textglossing toollanguage documentationactive learningretrieval-based modelslow-resource NLPmutable lexiconfeedback loop

0 comments

The pith

GlossAssist shows that annotator corrections can expand a mutable lexicon to improve gloss predictions without retraining the model.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper presents GlossAssist, a glossing tool built on the retrieval-based architecture of CWoMP that maintains a mutable lexicon of morpheme representations. Annotator corrections are incorporated directly into this lexicon as part of an active learning process, expanding it to ground and refine future predictions. Existing automated glossing systems are designed mainly for evaluation and lack paths for linguistic expertise to shape model behavior. A sympathetic reader would care because manual creation of interlinear glossed text remains slow and expensive in language documentation, especially for low-resource languages. The paper argues that treating this correction feedback loop as a design requirement would make such NLP tools practical for field linguists.

Core claim

GlossAssist is built around the retrieval-based architecture of CWoMP, which grounds predictions in a mutable lexicon of learned morpheme representations. In conjunction with CWoMP, the system treats each correction by an annotator as part of an active learning setting, which expands the lexicon and improves future predictions without having to retrain the model. The paper argues that this feedback loop should be treated as a design requirement for NLP tools aimed at documentary linguists.

What carries the argument

The mutable lexicon of learned morpheme representations in the CWoMP retrieval-based architecture, which stores and expands from annotator corrections to ground and improve subsequent gloss predictions.

If this is right

Annotators gain an interpretable path to correct predictions and have those corrections affect future outputs immediately.
The tool becomes suitable for practical corpus creation rather than only model evaluation.
Linguistic expertise from documentary linguists can influence system behavior without requiring machine learning retraining steps.
Studies of NLP model effects in low-resource documentation workflows become possible through the integrated interface.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

If lexicon updates prove stable across multiple annotators, the method could support collaborative documentation efforts on the same language.
Analogous mutable knowledge stores might reduce retraining needs in other annotation-heavy low-resource NLP tasks such as morphological analysis.
Measuring actual annotation time savings when using the tool versus static glossers would test real-world efficiency gains.

Load-bearing premise

Corrections made by annotators can be directly incorporated into the mutable lexicon to improve future predictions without model retraining or introducing errors from inconsistent annotations.

What would settle it

A controlled test in which linguists apply repeated corrections through the tool and subsequent gloss accuracy on held-out data shows no improvement or declines would falsify the central claim.

Figures

Figures reproduced from arXiv: 2606.04367 by Alexis Palmer, Bhargav Shandilya, Matt Buchholz.

**Figure 2.** Figure 2: GlossAssist Annotation Interface 3 GlossAssist Architecture Our tool4 consists of two integrated components: (1) an annotation interface for sentence-bysentence IGT production, and (2) a researcher dashboard for analyzing model performance across annotation sessions. Together, they are designed to support three modes of use - integrating existing NLP models into active documentation workflows, evaluating… view at source ↗

**Figure 3.** Figure 3: GlossAssist Annotation Interface - User Dash [PITH_FULL_IMAGE:figures/full_fig_p006_3.png] view at source ↗

read the original abstract

Interlinear glossed text (IGT) is the standard format for linguistic annotation in language documentation. Producing it manually, however, is often slow and costly. Automated glossing systems have improved substantially in recent years, but adoption among field linguists remains limited. Existing tools are designed to be evaluated rather than used, offering no interpretable path for correction or the incorporation of linguistic expertise back into model behavior. We present GlossAssist, a glossing tool built around the retrieval-based architecture of CWoMP (Contrastive Word-Morpheme Pre-training), which grounds predictions in a mutable lexicon of learned morpheme representations. In conjunction with CWoMP, our system treats each correction by an annotator as part of an active learning setting, which expands the lexicon and improves future predictions without having to retrain the model. In this paper, we present our interface and argue that this feedback loop should be treated as a design requirement for NLP tools aimed at documentary linguists.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

GlossAssist adds a correction-to-lexicon loop on CWoMP but gives no mechanism for inconsistent annotations and no evidence it helps.

read the letter

The core of the paper is a tool called GlossAssist that sits on the CWoMP retrieval model and lets annotators correct glosses, with those corrections expanding the mutable morpheme lexicon for future predictions without retraining. The authors treat this feedback loop as a required design feature for tools aimed at documentary linguists.

The new element is the specific packaging: an active-learning interface built around the existing contrastive pre-training setup, plus the explicit argument that usability for field workers demands this kind of direct update path. The paper correctly identifies that most current automated glossers are benchmark-oriented and offer no clear route for linguistic expertise to affect model behavior going forward.

The description of the interface itself is clear enough on the intended flow. That part is useful for anyone thinking about tool design in this area.

The main gaps are straightforward. No rule is given for what happens when the same morpheme receives conflicting glosses from different annotators or sessions; the stress-test concern about missing conflict resolution holds up on the provided description. In low-resource documentation that kind of inconsistency is common, so the claim that corrections will reliably improve predictions rests on an unstated assumption. There is also no evaluation of any kind—no accuracy numbers, no user timing data, no comparison to baseline workflows—so the practical benefit remains untested.

This is a tool-design paper for people working on NLP support for language documentation. Readers who build or evaluate such systems may pick up the design point about feedback loops. It is coherent enough on its own terms to deserve a serious referee, mainly because the usability problem it names is real even if the current evidence is thin.

Referee Report

2 major / 0 minor

Summary. The paper presents GlossAssist, a glossing tool based on the CWoMP retrieval-based architecture for interlinear glossed text. It features a mutable lexicon that incorporates annotator corrections in an active learning setting to improve future predictions without retraining. The authors argue that this feedback loop should be treated as a design requirement for NLP tools aimed at documentary linguists.

Significance. If the feedback mechanism works as described, it could address key barriers to adoption of NLP tools in language documentation by enabling seamless incorporation of expert corrections. The design emphasizes usability and interpretability, which is a positive step for practical applications in low-resource settings. The paper's contribution lies in highlighting the importance of active feedback in tool design rather than in empirical results.

major comments (2)

[Abstract] Abstract: The assertion that corrections expand the lexicon and improve predictions without retraining is load-bearing for the central claim, but the description provides no mechanism for resolving inconsistent corrections when the same morpheme receives divergent glosses from annotators. This is particularly problematic in low-resource documentation settings where such inconsistencies are routine.
[Abstract] Abstract: No evaluation data, error rates, user studies, or baseline comparisons are supplied to support the claims of improved predictions or usability, which is necessary to substantiate the argument that the feedback loop constitutes a design requirement.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for their constructive comments on our manuscript. We respond to each major comment below, indicating whether revisions will be incorporated.

read point-by-point responses

Referee: [Abstract] Abstract: The assertion that corrections expand the lexicon and improve predictions without retraining is load-bearing for the central claim, but the description provides no mechanism for resolving inconsistent corrections when the same morpheme receives divergent glosses from annotators. This is particularly problematic in low-resource documentation settings where such inconsistencies are routine.

Authors: We agree that explicit handling of inconsistent corrections is important for the claim. The manuscript describes the mutable lexicon as incorporating corrections via active learning to expand entries and affect retrieval, but does not detail conflict resolution for divergent glosses on the same form. We will revise to add a clear description of the mechanism (e.g., storing multiple context-sensitive entries and surfacing options to the annotator during use) to make the design fully transparent. revision: yes
Referee: [Abstract] Abstract: No evaluation data, error rates, user studies, or baseline comparisons are supplied to support the claims of improved predictions or usability, which is necessary to substantiate the argument that the feedback loop constitutes a design requirement.

Authors: The paper's primary contribution, as the referee notes, is the design argument and tool presentation rather than empirical results. The claim that the feedback loop improves predictions rests on the retrieval-based CWoMP architecture, where lexicon updates directly influence future outputs without retraining; this is presented as a conceptual design requirement for documentary tools. Quantitative evaluations are outside the scope of this work, which focuses on architecture and interface. revision: no

Circularity Check

0 steps flagged

Tool design paper presents no derivations or fitted predictions

full rationale

The manuscript describes an interface and a proposed feedback mechanism for expanding a mutable lexicon from annotator corrections. No equations, parameter fitting, uniqueness theorems, or derivation chains appear in the provided text. The central claim is a design recommendation rather than a result obtained from prior steps, so the architecture and feedback loop do not reduce to their own inputs by construction. Self-citation of CWoMP is present but does not carry a load-bearing derivation; the paper remains self-contained as a systems contribution.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

No free parameters, mathematical axioms, or invented entities are described in the abstract; the work is a systems/tool paper rather than a theoretical derivation.

pith-pipeline@v0.9.1-grok · 5708 in / 1082 out tokens · 20413 ms · 2026-06-28T06:51:04.614732+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

26 extracted references · 12 canonical work pages

[1]

Aho and Jeffrey D

Alfred V. Aho and Jeffrey D. Ullman , title =. 1972

1972
[2]

Publications Manual , year = "1983", publisher =

1983
[3]

Chandra and Dexter C

Ashok K. Chandra and Dexter C. Kozen and Larry J. Stockmeyer , year = "1981", title =. doi:10.1145/322234.322243

work page doi:10.1145/322234.322243 1981
[4]

Scalable training of

Andrew, Galen and Gao, Jianfeng , booktitle=. Scalable training of
[5]

Dan Gusfield , title =. 1997

1997
[6]

Tetreault , title =

Mohammad Sadegh Rasooli and Joel R. Tetreault , title =. Computing Research Repository , volume =. 2015 , url =

2015
[7]

Journal of Machine Learning Research , Month = dec, Numpages =

Ando, Rie Kubota and Zhang, Tong , Issn =. Journal of Machine Learning Research , Month = dec, Numpages =
[8]

Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing

Rice, Enora and von der Wense, Katharina and Palmer, Alexis. Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing. 2025. doi:10.18653/v1/2025.emnlp-main.568

work page doi:10.18653/v1/2025.emnlp-main.568 2025
[9]

2603.18184 , archivePrefix=

Morris Alper and Enora Rice and Bhargav Shandilya and Alexis Palmer and Lori Levin , year=. 2603.18184 , archivePrefix=

arXiv
[10]

2601.10925 , archivePrefix=

Michael Ginn and Lindia Tjuatja and Enora Rice and Ali Marashian and Maria Valentini and Jasmine Xu and Graham Neubig and Alexis Palmer , year=. 2601.10925 , archivePrefix=

Pith/arXiv arXiv
[11]

2403.06399 , archivePrefix=

Michael Ginn and Lindia Tjuatja and Taiqi He and Enora Rice and Graham Neubig and Alexis Palmer and Lori Levin , year=. 2403.06399 , archivePrefix=

arXiv
[12]

Findings of the SIGMORPHON 2023 Shared Task on Interlinear Glossing

Ginn, Michael and Moeller, Sarah and Palmer, Alexis and Stacey, Anna and Nicolai, Garrett and Hulden, Mans and Silfverberg, Miikka. Findings of the SIGMORPHON 2023 Shared Task on Interlinear Glossing. Proceedings of the 20th SIGMORPHON workshop on Computational Research in Phonetics, Phonology, and Morphology. 2023. doi:10.18653/v1/2023.sigmorphon-1.20

work page doi:10.18653/v1/2023.sigmorphon-1.20 2023
[13]

Proceedings of the 31st International Conference on Computational Linguistics

Shandilya, Bhargav and Palmer, Alexis. Proceedings of the 31st International Conference on Computational Linguistics. 2025

2025
[14]

Evaluating Automation Strategies in Language Documentation

Palmer, Alexis and Moon, Taesun and Baldridge, Jason. Evaluating Automation Strategies in Language Documentation. Proceedings of the NAACL HLT 2009 Workshop on Active Learning for Natural Language Processing. 2009

2009
[15]

Proceedings of the Workshop on Computational Modeling of Polysynthetic Languages

Moeller, Sarah and Hulden, Mans. Proceedings of the Workshop on Computational Modeling of Polysynthetic Languages. 2018

2018
[16]

Girrbach, Leander. T. Proceedings of the 20th SIGMORPHON workshop on Computational Research in Phonetics, Phonology, and Morphology. 2023. doi:10.18653/v1/2023.sigmorphon-1.19

work page doi:10.18653/v1/2023.sigmorphon-1.19 2023
[17]

Can we teach language models to gloss endangered languages?

Ginn, Michael and Hulden, Mans and Palmer, Alexis. Can we teach language models to gloss endangered languages?. Findings of the Association for Computational Linguistics: EMNLP 2024. 2024. doi:10.18653/v1/2024.findings-emnlp.337

work page doi:10.18653/v1/2024.findings-emnlp.337 2024
[18]

Proceedings of the Fifth Workshop on the Use of Computational Methods in the Study of Endangered Languages

Gessler, Luke. Proceedings of the Fifth Workshop on the Use of Computational Methods in the Study of Endangered Languages. 2022. doi:10.18653/v1/2022.computel-1.15

work page doi:10.18653/v1/2022.computel-1.15 2022
[19]

Proceedings of the Seventh Workshop on the Use of Computational Methods in the Study of Endangered Languages

Moeller, Sarah and Arppe, Antti. Proceedings of the Seventh Workshop on the Use of Computational Methods in the Study of Endangered Languages. 2024. doi:10.18653/v1/2024.computel-1.5

work page doi:10.18653/v1/2024.computel-1.5 2024
[20]

2009 , institution=

Active Learning Literature Survey , author=. 2009 , institution=

2009
[21]

Findings of the Association for Computational Linguistics: ACL 2025

Schroeder, Hope and Roy, Deb and Kabbara, Jad. Findings of the Association for Computational Linguistics: ACL 2025. 2025. doi:10.18653/v1/2025.findings-acl.1323

work page doi:10.18653/v1/2025.findings-acl.1323 2025
[22]

Proceedings of the 28th International Conference on Computational Linguistics

Zhao, Xingyuan and Ozaki, Satoru and Anastasopoulos, Antonios and Neubig, Graham and Levin, Lori. Proceedings of the 28th International Conference on Computational Linguistics. 2020. doi:10.18653/v1/2020.coling-main.471

work page doi:10.18653/v1/2020.coling-main.471 2020
[23]

Proceedings of the 16th Workshop on Computational Research in Phonetics, Phonology, and Morphology

Cross-lingual morphological inflection with explicit alignment. Proceedings of the 16th Workshop on Computational Research in Phonetics, Phonology, and Morphology. 2019. doi:10.18653/v1/W19-4209

work page doi:10.18653/v1/w19-4209 2019
[24]

An Investigation of Noise in Morphological Inflection

Wiemerslage, Adam and Yang, Changbing and Nicolai, Garrett and Silfverberg, Miikka and Kann, Katharina. An Investigation of Noise in Morphological Inflection. Findings of the Association for Computational Linguistics: ACL 2023. 2023. doi:10.18653/v1/2023.findings-acl.207

work page doi:10.18653/v1/2023.findings-acl.207 2023
[25]

Glossy Bytes: Neural Glossing using Subword Encoding

Cross, Ziggy and Yun, Michelle and Apparaju, Ananya and MacCabe, Jata and Nicolai, Garrett and Silfverberg, Miikka. Glossy Bytes: Neural Glossing using Subword Encoding. Proceedings of the 20th SIGMORPHON workshop on Computational Research in Phonetics, Phonology, and Morphology. 2023. doi:10.18653/v1/2023.sigmorphon-1.24

work page doi:10.18653/v1/2023.sigmorphon-1.24 2023
[26]

Directions for Interlinear Morphemic Translations , volume =

Lehmann, Christian , year =. Directions for Interlinear Morphemic Translations , volume =. Folia Linguistica - FOLIA LINGUIST , doi =

[1] [1]

Aho and Jeffrey D

Alfred V. Aho and Jeffrey D. Ullman , title =. 1972

1972

[2] [2]

Publications Manual , year = "1983", publisher =

1983

[3] [3]

Chandra and Dexter C

Ashok K. Chandra and Dexter C. Kozen and Larry J. Stockmeyer , year = "1981", title =. doi:10.1145/322234.322243

work page doi:10.1145/322234.322243 1981

[4] [4]

Scalable training of

Andrew, Galen and Gao, Jianfeng , booktitle=. Scalable training of

[5] [5]

Dan Gusfield , title =. 1997

1997

[6] [6]

Tetreault , title =

Mohammad Sadegh Rasooli and Joel R. Tetreault , title =. Computing Research Repository , volume =. 2015 , url =

2015

[7] [7]

Journal of Machine Learning Research , Month = dec, Numpages =

Ando, Rie Kubota and Zhang, Tong , Issn =. Journal of Machine Learning Research , Month = dec, Numpages =

[8] [8]

Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing

Rice, Enora and von der Wense, Katharina and Palmer, Alexis. Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing. 2025. doi:10.18653/v1/2025.emnlp-main.568

work page doi:10.18653/v1/2025.emnlp-main.568 2025

[9] [9]

2603.18184 , archivePrefix=

Morris Alper and Enora Rice and Bhargav Shandilya and Alexis Palmer and Lori Levin , year=. 2603.18184 , archivePrefix=

arXiv

[10] [10]

2601.10925 , archivePrefix=

Michael Ginn and Lindia Tjuatja and Enora Rice and Ali Marashian and Maria Valentini and Jasmine Xu and Graham Neubig and Alexis Palmer , year=. 2601.10925 , archivePrefix=

Pith/arXiv arXiv

[11] [11]

2403.06399 , archivePrefix=

Michael Ginn and Lindia Tjuatja and Taiqi He and Enora Rice and Graham Neubig and Alexis Palmer and Lori Levin , year=. 2403.06399 , archivePrefix=

arXiv

[12] [12]

Findings of the SIGMORPHON 2023 Shared Task on Interlinear Glossing

Ginn, Michael and Moeller, Sarah and Palmer, Alexis and Stacey, Anna and Nicolai, Garrett and Hulden, Mans and Silfverberg, Miikka. Findings of the SIGMORPHON 2023 Shared Task on Interlinear Glossing. Proceedings of the 20th SIGMORPHON workshop on Computational Research in Phonetics, Phonology, and Morphology. 2023. doi:10.18653/v1/2023.sigmorphon-1.20

work page doi:10.18653/v1/2023.sigmorphon-1.20 2023

[13] [13]

Proceedings of the 31st International Conference on Computational Linguistics

Shandilya, Bhargav and Palmer, Alexis. Proceedings of the 31st International Conference on Computational Linguistics. 2025

2025

[14] [14]

Evaluating Automation Strategies in Language Documentation

Palmer, Alexis and Moon, Taesun and Baldridge, Jason. Evaluating Automation Strategies in Language Documentation. Proceedings of the NAACL HLT 2009 Workshop on Active Learning for Natural Language Processing. 2009

2009

[15] [15]

Proceedings of the Workshop on Computational Modeling of Polysynthetic Languages

Moeller, Sarah and Hulden, Mans. Proceedings of the Workshop on Computational Modeling of Polysynthetic Languages. 2018

2018

[16] [16]

Girrbach, Leander. T. Proceedings of the 20th SIGMORPHON workshop on Computational Research in Phonetics, Phonology, and Morphology. 2023. doi:10.18653/v1/2023.sigmorphon-1.19

work page doi:10.18653/v1/2023.sigmorphon-1.19 2023

[17] [17]

Can we teach language models to gloss endangered languages?

Ginn, Michael and Hulden, Mans and Palmer, Alexis. Can we teach language models to gloss endangered languages?. Findings of the Association for Computational Linguistics: EMNLP 2024. 2024. doi:10.18653/v1/2024.findings-emnlp.337

work page doi:10.18653/v1/2024.findings-emnlp.337 2024

[18] [18]

Proceedings of the Fifth Workshop on the Use of Computational Methods in the Study of Endangered Languages

Gessler, Luke. Proceedings of the Fifth Workshop on the Use of Computational Methods in the Study of Endangered Languages. 2022. doi:10.18653/v1/2022.computel-1.15

work page doi:10.18653/v1/2022.computel-1.15 2022

[19] [19]

Proceedings of the Seventh Workshop on the Use of Computational Methods in the Study of Endangered Languages

Moeller, Sarah and Arppe, Antti. Proceedings of the Seventh Workshop on the Use of Computational Methods in the Study of Endangered Languages. 2024. doi:10.18653/v1/2024.computel-1.5

work page doi:10.18653/v1/2024.computel-1.5 2024

[20] [20]

2009 , institution=

Active Learning Literature Survey , author=. 2009 , institution=

2009

[21] [21]

Findings of the Association for Computational Linguistics: ACL 2025

Schroeder, Hope and Roy, Deb and Kabbara, Jad. Findings of the Association for Computational Linguistics: ACL 2025. 2025. doi:10.18653/v1/2025.findings-acl.1323

work page doi:10.18653/v1/2025.findings-acl.1323 2025

[22] [22]

Proceedings of the 28th International Conference on Computational Linguistics

Zhao, Xingyuan and Ozaki, Satoru and Anastasopoulos, Antonios and Neubig, Graham and Levin, Lori. Proceedings of the 28th International Conference on Computational Linguistics. 2020. doi:10.18653/v1/2020.coling-main.471

work page doi:10.18653/v1/2020.coling-main.471 2020

[23] [23]

Proceedings of the 16th Workshop on Computational Research in Phonetics, Phonology, and Morphology

Cross-lingual morphological inflection with explicit alignment. Proceedings of the 16th Workshop on Computational Research in Phonetics, Phonology, and Morphology. 2019. doi:10.18653/v1/W19-4209

work page doi:10.18653/v1/w19-4209 2019

[24] [24]

An Investigation of Noise in Morphological Inflection

Wiemerslage, Adam and Yang, Changbing and Nicolai, Garrett and Silfverberg, Miikka and Kann, Katharina. An Investigation of Noise in Morphological Inflection. Findings of the Association for Computational Linguistics: ACL 2023. 2023. doi:10.18653/v1/2023.findings-acl.207

work page doi:10.18653/v1/2023.findings-acl.207 2023

[25] [25]

Glossy Bytes: Neural Glossing using Subword Encoding

Cross, Ziggy and Yun, Michelle and Apparaju, Ananya and MacCabe, Jata and Nicolai, Garrett and Silfverberg, Miikka. Glossy Bytes: Neural Glossing using Subword Encoding. Proceedings of the 20th SIGMORPHON workshop on Computational Research in Phonetics, Phonology, and Morphology. 2023. doi:10.18653/v1/2023.sigmorphon-1.24

work page doi:10.18653/v1/2023.sigmorphon-1.24 2023

[26] [26]

Directions for Interlinear Morphemic Translations , volume =

Lehmann, Christian , year =. Directions for Interlinear Morphemic Translations , volume =. Folia Linguistica - FOLIA LINGUIST , doi =