CoPersona: Collaborative Persona Graphs for Robust LLM Personalization

Hiren Madhu; Leyao Wang; Ngoc Bui; Rex Ying; Walter Roznyatovskiy; Yangtian Zhang

arxiv: 2607.01485 · v1 · pith:AIQFLKVGnew · submitted 2026-07-01 · 💻 cs.IR

CoPersona: Collaborative Persona Graphs for Robust LLM Personalization

Yangtian Zhang , Leyao Wang , Hiren Madhu , Ngoc Bui , Walter Roznyatovskiy , Rex Ying This is my paper

Pith reviewed 2026-07-03 18:18 UTC · model grok-4.3

classification 💻 cs.IR

keywords LLM personalizationcollaborative personalizationpersona graphsmultiplex graphssparse user historiesfacet alignmentuser similarity

0 comments

The pith

CoPersona builds multiplex persona graphs to model facet-level user alignments and borrow signals from peers for robust LLM personalization with sparse histories.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

Sparse and skewed user interaction histories make it hard for LLMs to infer preferences on under-observed facets, leading to brittle personalization. CoPersona tackles this by decomposing histories into facet-level representations and using a multiplex persona graph to explicitly align similar users on each facet separately. This allows borrowing relevant signals from peers without the bias that arises when comparing users in a single global space. The system uses a dual-branch setup with peer retrieval and graph reasoning during inference. Tests across domains and scales show steady gains over baselines.

Core claim

CoPersona decomposes interaction histories into multiple facet-level representations and explicitly models peer-to-peer, facet-level alignment through a multiplex persona graph to complete sparse user profiles by borrowing signals from behaviorally similar peers, using a dual-branch architecture of non-parametric peer retrieval and parametric graph reasoning at inference time.

What carries the argument

The multiplex persona graph that decomposes user histories into facets and connects peers at the facet level to enable aligned signal transfer.

If this is right

Consistent performance improvements over strong baselines in multiple domains and model scales.
Greater robustness when test-time requests involve under-supported facets.
Effective use of peer information without direct transfer of biased global signals.
Scalable collaborative personalization that handles uneven facet coverage.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The facet-level alignment could help in other sparse data settings like recommendation systems beyond LLMs.
Interpreting the graph edges might provide insights into why certain users are similar on specific preferences.
Extending the graph to include temporal facets could capture evolving user interests.

Load-bearing premise

That decomposing histories into facet-level representations and modeling explicit peer-to-peer alignment in a multiplex graph can overcome bias from uneven facet coverage that obscures similarity in the global space.

What would settle it

A controlled test on datasets with deliberately skewed facet distributions where CoPersona shows no improvement or worse performance than global similarity baselines would indicate the assumption does not hold.

Figures

Figures reproduced from arXiv: 2607.01485 by Hiren Madhu, Leyao Wang, Ngoc Bui, Rex Ying, Walter Roznyatovskiy, Yangtian Zhang.

**Figure 2.** Figure 2: CoPersona overview. CoPersona personalizes an LLM by (1) mining representative users and inducing a global facet [PITH_FULL_IMAGE:figures/full_fig_p004_2.png] view at source ↗

**Figure 3.** Figure 3: Sensitivity to decoding temperature. 3 4 5 6 # Facets 0.390 0.392 0.394 0.396 0.398 0.400 ROUGE-1 ROUGE-1 ROUGE-L 0.2200 0.2225 0.2250 0.2275 0.2300 0.2325 ROUGE-L 3 4 5 6 # Facets 0.3300 0.3325 0.3350 0.3375 0.3400 0.3425 0.3450 0.3475 METEOR METEOR 3 4 5 6 # Facets 15.0 15.2 15.4 15.6 15.8 16.0 BLEU BLEU [PITH_FULL_IMAGE:figures/full_fig_p008_3.png] view at source ↗

**Figure 5.** Figure 5: Ablation on the number of retrieved neighbors [PITH_FULL_IMAGE:figures/full_fig_p008_5.png] view at source ↗

read the original abstract

Real-world LLM personalization is often constrained by sparse and skewed user histories: most users provide only a handful of interactions, while even frequent users' logs capture an incomplete and biased view of their preferences. As a result, weakly observed user attributes are difficult to infer, leading to brittle personalization when test-time requests shift toward under-supported facets. Motivated by this limitation, we present CoPersona, a graph-based collaborative personalization framework that completes sparse user profiles by borrowing signals from behaviorally similar peers. However, directly transferring signals is difficult because uneven facet coverage introduces bias into interaction histories, obscuring user similarity in the unstructured global space. To address this issue, CoPersona decomposes interaction histories into multiple facet-level representations and explicitly models peer-to-peer, facet-level alignment through a multiplex persona graph. To effectively leverage peer information at inference time, we employ a dual-branch architecture that combines non-parametric peer retrieval with parametric graph reasoning. Experiments across multiple domains and model scales demonstrate consistent improvements over strong baselines, validating CoPersona as an effective approach for robust LLM personalization.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

CoPersona targets sparse LLM histories with facet graphs and dual-branch inference, but the decomposition may simply move the coverage bias rather than remove it.

read the letter

CoPersona decomposes user histories into facets, builds a multiplex persona graph for peer alignment at the facet level, and uses a dual-branch setup (non-parametric retrieval plus parametric graph reasoning) to pull in signals from similar users.

The combination of explicit facet alignment and the dual-branch architecture is the concrete new piece. It directly tackles the motivation that global similarity gets distorted by uneven facet coverage in sparse logs, which is a practical constraint in deployed systems.

The paper does a clean job stating the problem and outlining why borrowing from peers at the facet level could help without needing more data collection.

The soft spot is the one in the stress-test note. If facets are extracted from the same biased and incomplete histories, under-supported areas will still yield weak or missing representations, and the multiplex graph will have little to align on those facets. The abstract gives no independent discovery mechanism or bootstrap, so the central assumption needs the full methods section to hold up.

Experiments are described as showing consistent gains across domains and scales, but without ablations or implementation details visible here it is difficult to separate the graph contribution from other choices.

This is for researchers and engineers working on production LLM personalization or collaborative methods for recommendation. Readers who care about graph-based ways to handle data sparsity would find the framework worth examining.

It deserves peer review because the problem is real and the proposal is specific enough to test, even if the bias-handling claim will need stronger evidence.

Referee Report

2 major / 2 minor

Summary. The manuscript introduces CoPersona, a graph-based collaborative framework for LLM personalization. It decomposes sparse and skewed user interaction histories into multiple facet-level representations, constructs a multiplex persona graph to explicitly model peer-to-peer facet alignments, and employs a dual-branch inference architecture combining non-parametric peer retrieval with parametric graph reasoning. Experiments across domains and model scales report consistent gains over strong baselines.

Significance. If the central claims hold, the work addresses a practically important limitation in real-world LLM personalization—brittle performance on under-supported facets due to uneven history coverage—by leveraging collaborative signals at the facet level rather than globally. The explicit multiplex-graph modeling and dual-branch design are concrete technical contributions that could influence subsequent graph-augmented personalization systems.

major comments (2)

[Motivation and §3] Motivation and §3 (Method): The claim that facet-level decomposition yields cleaner similarity signals than the global space rests on the assumption that facet extraction itself is not biased by the same sparse coverage. The manuscript does not describe an independent facet-discovery procedure (e.g., external corpus, pre-trained topic model, or bootstrap step) that would avoid inheriting coverage bias from the original histories; without this, the multiplex-graph edges on under-supported facets remain unreliable, directly undermining the central motivation.
[§4 and Table X] §4 (Experiments) and Table X (main results): While consistent improvements are reported, the absence of an ablation that isolates the contribution of the multiplex alignment (versus simple global retrieval or single-facet graphs) makes it difficult to attribute gains specifically to the facet-level peer modeling. This is load-bearing because the paper’s novelty claim centers on the multiplex structure.

minor comments (2)

[§3] Notation for the multiplex graph (e.g., edge types per facet) should be formalized with a clear mathematical definition early in §3 to improve readability.
[Figure 2] The dual-branch architecture diagram would benefit from explicit labeling of which branch is non-parametric and which is parametric.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive feedback highlighting important aspects of our motivation and experimental design. We address each major comment below and commit to revisions that strengthen the manuscript.

read point-by-point responses

Referee: [Motivation and §3] Motivation and §3 (Method): The claim that facet-level decomposition yields cleaner similarity signals than the global space rests on the assumption that facet extraction itself is not biased by the same sparse coverage. The manuscript does not describe an independent facet-discovery procedure (e.g., external corpus, pre-trained topic model, or bootstrap step) that would avoid inheriting coverage bias from the original histories; without this, the multiplex-graph edges on under-supported facets remain unreliable, directly undermining the central motivation.

Authors: We agree that the reliability of facet extraction is central to the motivation and that the current manuscript does not provide sufficient detail on an independent discovery procedure. In the revised version we will expand §3 to explicitly describe the facet extraction process, incorporate a bootstrap initialization step drawing on a small external seed corpus, and add a discussion of how this mitigates inheritance of coverage bias into the multiplex edges. This revision directly addresses the concern while preserving the core collaborative-alignment contribution. revision: yes
Referee: [§4 and Table X] §4 (Experiments) and Table X (main results): While consistent improvements are reported, the absence of an ablation that isolates the contribution of the multiplex alignment (versus simple global retrieval or single-facet graphs) makes it difficult to attribute gains specifically to the facet-level peer modeling. This is load-bearing because the paper’s novelty claim centers on the multiplex structure.

Authors: We acknowledge that the lack of a targeted ablation for the multiplex component limits the ability to isolate its contribution. The existing experiments compare against strong baselines but do not include the requested controls. In the revision we will add an ablation study to §4 and update Table X to compare the full multiplex model against (i) global retrieval without facet decomposition and (ii) single-facet graphs, thereby providing clearer evidence for the value of the multiplex structure. revision: yes

Circularity Check

0 steps flagged

No circularity: method described at conceptual level with no equations or self-referential fits

full rationale

The provided abstract and description contain no mathematical derivations, equations, fitted parameters, or self-citations that could reduce claims to inputs by construction. The framework is presented as a high-level graph-based approach without visible prediction steps that loop back to fitted values. Per rules, absence of load-bearing equations or self-citation chains means no circularity can be exhibited via direct quote and reduction.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Abstract-only view yields no identifiable free parameters, axioms, or invented entities; full text required for ledger construction.

pith-pipeline@v0.9.1-grok · 5726 in / 1013 out tokens · 26945 ms · 2026-07-03T18:18:34.983558+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

78 extracted references · 34 canonical work pages · 7 internal anchors

[1]

Nor Aniza Abdullah, Rasheed Abubakar Rasheed, Mohd Hairul Nizam Md Nasir, and Md Mujibur Rahman. 2021. Eliciting auxiliary information for cold start user recommendation: A survey.Applied Sciences11, 20 (2021), 9608

2021
[2]

Gati V Aher, Rosa I Arriaga, and Adam Tauman Kalai. 2023. Using large language models to simulate multiple humans and replicate human subject studies. In International conference on machine learning. PMLR, 337–371

2023
[3]

Steven Au, Cameron J Dimacali, Ojasmitha Pedirappagari, Namyong Park, Franck Dernoncourt, Yu Wang, Nikos Kanakaris, Hanieh Deilamsalehy, Ryan A Rossi, and Nesreen K Ahmed. 2025. Personalized graph-based retrieval for large language models.arXiv preprint arXiv:2501.02157(2025)

work page arXiv 2025
[4]

Satanjeev Banerjee and Alon Lavie. 2005. METEOR: An automatic metric for MT evaluation with improved correlation with human judgments. InProceedings of the acl workshop on intrinsic and extrinsic evaluation measures for machine translation and/or summarization. 65–72

2005
[5]

Iz Beltagy, Matthew E Peters, and Arman Cohan. 2020. Longformer: The long- document transformer.arXiv preprint arXiv:2004.05150(2020)

work page internal anchor Pith review Pith/arXiv arXiv 2020
[6]

JesúS Bobadilla, Fernando Ortega, Antonio Hernando, and Jesús Bernal. 2012. A collaborative filtering approach to mitigate the new user cold start problem. Knowledge-based systems26 (2012), 225–238

2012
[7]

Ngoc Bui, Hieu Trung Nguyen, Shantanu Kumar, Julian Theodore, Weikang Qiu, Viet Anh Nguyen, and Rex Ying. 2025. Mixture-of-personas language models for population simulation.arXiv preprint arXiv:2504.05019(2025)

work page arXiv 2025
[8]

Jiawei Chen, Hande Dong, Xiang Wang, Fuli Feng, Meng Wang, and Xiangnan He
[9]

arXiv:2010.03240 [cs.IR] https://arxiv.org/abs/2010.03240

Bias and Debias in Recommender System: A Survey and Future Directions. arXiv:2010.03240 [cs.IR] https://arxiv.org/abs/2010.03240

work page arXiv 2010
[10]

Jin Chen, Zheng Liu, Xu Huang, Chenwang Wu, Qi Liu, Gangwei Jiang, Yuanhao Pu, Yuxuan Lei, Xiaolong Chen, Xingmei Wang, et al. 2024. When large language models meet personalization: Perspectives of challenges and opportunities.World Wide Web27, 4 (2024), 42

2024
[11]

Yi-Pei Chen, Noriki Nishida, Hideki Nakayama, and Yuji Matsumoto. 2024. Recent Trends in Personalized Dialogue Generation: A Review of Datasets, Methodolo- gies, and Evaluations. arXiv:2405.17974 [cs.CL] https://arxiv.org/abs/2405.17974

work page arXiv 2024
[12]

J. A. Hartigan and M. A. Wong. 1979. Algorithm AS 136: A K-Means Clustering Algorithm.Applied Statistics28, 1 (1979), 100–108. doi:10.2307/2346830

work page doi:10.2307/2346830 1979
[13]

Liam Hebert, Krishna Sayana, Ambarish Jash, Alexandros Karatzoglou, Sukhdeep Sodhi, Sumanth Doddapaneni, Yanli Cai, and Dima Kuzmin. 2024. Persoma: Per- sonalized soft prompt adapter architecture for personalized language prompting. arXiv preprint arXiv:2408.00960(2024)

work page arXiv 2024
[14]

Yupeng Hou, Jiacheng Li, Zhankui He, An Yan, Xiusi Chen, and Julian McAuley
[15]

Bridging language and items for retrieval and recommendation.arXiv preprint arXiv:2403.03952(2024)

work page internal anchor Pith review Pith/arXiv arXiv 2024
[16]

Edward J Hu, Yelong Shen, Phillip Wallis, Zeyuan Allen-Zhu, Yuanzhi Li, Shean Wang, Lu Wang, Weizhu Chen, et al. 2022. Lora: Low-rank adaptation of large language models.ICLR1, 2 (2022), 3

2022
[17]

Qiushi Huang, Shuai Fu, Xubo Liu, Wenwu Wang, Tom Ko, Yu Zhang, and Lilian Tang. 2023. Learning retrieval augmentation for personalized dialogue generation. InProceedings of the 2023 Conference on Empirical Methods in Natural Language Processing. 2523–2540

2023
[18]

EunJeong Hwang, Bodhisattwa Majumder, and Niket Tandon. 2023. Aligning language models to user opinions. InFindings of the Association for Computational Linguistics: EMNLP 2023. 5906–5919

2023
[19]

Gautier Izacard, Mathilde Caron, Lucas Hosseini, Sebastian Riedel, Piotr Bo- janowski, Armand Joulin, and Edouard Grave. 2021. Unsupervised dense in- formation retrieval with contrastive learning.arXiv preprint arXiv:2112.09118 (2021)

work page internal anchor Pith review Pith/arXiv arXiv 2021
[20]

Joel Jang, Seungone Kim, Bill Yuchen Lin, Yizhong Wang, Jack Hessel, Luke Zettlemoyer, Hannaneh Hajishirzi, Yejin Choi, and Prithviraj Ammanabrolu
[21]

Personalized soups: Personalized large language model alignment via post-hoc parameter merging.arXiv preprint arXiv:2310.11564(2023)

work page arXiv 2023
[22]

Daniel Jurafsky and James H. Martin. 2026.Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition, with Language Models(3rd ed.). Online manuscript released January 6, 2026. https://web.stanford.edu/~jurafsky/slp3/

2026
[23]

Wang-Cheng Kang, Jianmo Ni, Nikhil Mehta, Maheswaran Sathiamoorthy, Lichan Hong, Ed Chi, and Derek Zhiyuan Cheng. 2023. Do llms understand user prefer- ences? evaluating llms on user rating prediction.arXiv preprint arXiv:2305.06474 (2023)

work page arXiv 2023
[24]

Ishita Kumar, Snigdha Viswanathan, Sushrita Yerra, Alireza Salemi, Ryan A Rossi, Franck Dernoncourt, Hanieh Deilamsalehy, Xiang Chen, Ruiyi Zhang, Shubham Agarwal, et al. 2024. Longlamp: A benchmark for personalized long-form text generation.arXiv preprint arXiv:2407.11016(2024)

work page arXiv 2024
[25]

Woosuk Kwon, Zhuohan Li, Siyuan Zhuang, Ying Sheng, Lianmin Zheng, Cody Hao Yu, Joseph Gonzalez, Hao Zhang, and Ion Stoica. 2023. Efficient memory management for large language model serving with pagedattention. In Proceedings of the 29th Symposium on Operating Systems Principles. 611–626

2023
[26]

Cheng Li, Mingyang Zhang, Qiaozhu Mei, Weize Kong, and Michael Bendersky
[27]

InProceedings of the ACM Web Conference 2024

Learning to rewrite prompts for personalized text generation. InProceedings of the ACM Web Conference 2024. 3367–3378

2024
[28]

Cheng Li, Mingyang Zhang, Qiaozhu Mei, Yaqing Wang, Spurthi Amba Hombaiah, Yi Liang, and Michael Bendersky. 2023. Teach LLMs to Personalize–An Approach inspired by Writing Education.arXiv preprint arXiv:2308.07968(2023)

work page arXiv 2023
[29]

Jinming Li, Wentao Zhang, Tian Wang, Guanglei Xiong, Alan Lu, and Gerard Medioni. 2023. GPT4Rec: A generative framework for personalized recommen- dation and user interests interpretation.arXiv preprint arXiv:2304.03879(2023)

work page arXiv 2023
[30]

Xinyu Li, Ruiyang Zhou, Zachary C Lipton, and Liu Leqi. 2024. Personal- ized language modeling from personalized human feedback.arXiv preprint arXiv:2402.05133(2024)

work page arXiv 2024
[31]

Blerina Lika, Kostas Kolomvatsos, and Stathes Hadjiefthymiades. 2014. Facing the cold start problem in recommender systems.Expert systems with applications 41, 4 (2014), 2065–2073

2014
[32]

Chin-Yew Lin. 2004. Rouge: A package for automatic evaluation of summaries. InText summarization branches out. 74–81

2004
[33]

Jiongnan Liu, Yutao Zhu, Shuting Wang, Xiaochi Wei, Erxue Min, Yu Lu, Shuaiqiang Wang, Dawei Yin, and Zhicheng Dou. 2025. Llms+ persona-plug= personalized llms. InProceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). 9373–9385

2025
[34]

Ilya Loshchilov and Frank Hutter. 2017. Decoupled weight decay regularization. arXiv preprint arXiv:1711.05101(2017)

work page internal anchor Pith review Pith/arXiv arXiv 2017
[35]

Zhenyi Lu, Wei Wei, Xiaoye Qu, Xian-Ling Mao, Dangyang Chen, and Jixiong Chen. 2023. Miracle: Towards Personalized Dialogue Generation with Latent- Space Multiple Personal Attribute Control. InFindings of the Association for KDD ’26, August 09–13, 2026, Jeju Island, Republic of Korea Zhang et al. Computational Linguistics: EMNLP 2023, Houda Bouamor, Juan ...

work page doi:10.18653/v1/2023.findings-emnlp.395 2023
[36]

Hanjia Lyu, Song Jiang, Hanqing Zeng, Yinglong Xia, Qifan Wang, Si Zhang, Ren Chen, Chris Leung, Jiajie Tang, and Jiebo Luo. 2024. Llm-rec: Personalized recom- mendation via prompting large language models. InFindings of the Association for Computational Linguistics: NAACL 2024. 583–612

2024
[37]

Wenyu Mao, Jiancan Wu, Weijian Chen, Chongming Gao, Xiang Wang, and Xiangnan He. 2025. Reinforced prompt personalization for recommendation with large language models.ACM Transactions on Information Systems43, 3 (2025), 1–27

2025
[38]

Sheshera Mysore, Zhuoran Lu, Mengting Wan, Longqi Yang, Bahareh Sarrafzadeh, Steve Menezes, Tina Baghaee, Emmanuel Barajas Gonzalez, Jennifer Neville, and Tara Safavi. 2024. Pearl: Personalizing large language model writing assis- tants with generation-calibrated retrievers. InProceedings of the 1st Workshop on Customizable NLP: Progress and Challenges in...

2024
[39]

Jianmo Ni, Jiacheng Li, and Julian McAuley. 2019. Justifying recommendations using distantly-labeled reviews and fine-grained aspects. InProceedings of the 2019 conference on empirical methods in natural language processing and the 9th international joint conference on natural language processing (EMNLP-IJCNLP). 188–197

2019
[40]

OpenAI. 2024. GPT-4 Technical Report. arXiv:2303.08774 [cs.CL] https://arxiv. org/abs/2303.08774

work page internal anchor Pith review Pith/arXiv arXiv 2024
[41]

Kishore Papineni, Salim Roukos, Todd Ward, and Wei-Jing Zhu. 2002. Bleu: a method for automatic evaluation of machine translation. InProceedings of the 40th annual meeting of the Association for Computational Linguistics. 311–318

2002
[42]

Chanwoo Park, Mingyang Liu, Dingwen Kong, Kaiqing Zhang, and Asuman Ozdaglar. 2024. Rlhf from heterogeneous feedback via personalization and pref- erence aggregation.arXiv preprint arXiv:2405.00254(2024)

work page arXiv 2024
[43]

Qiyao Peng, Hongtao Liu, Hongyan Xu, Qing Yang, Minglai Shao, and Wenjun Wang. 2024. LLM: Harnessing Large Language Models for Personalized Review Generation.arXiv preprint arXiv:2407.07487(2024)

work page arXiv 2024
[44]

Sriyash Poddar, Yanming Wan, Hamish Ivison, Abhishek Gupta, and Natasha Jaques. 2024. Personalizing reinforcement learning from human feedback with variational preference learning.Advances in Neural Information Processing Sys- tems37 (2024), 52516–52544

2024
[45]

Matt Post. 2018. A Call for Clarity in Reporting BLEU Scores. InProceedings of the Third Conference on Machine Translation: Research Papers. 186–191

2018
[46]

Yilun Qiu, Tianhao Shi, Xiaoyan Zhao, Fengbin Zhu, Yang Zhang, and Fuli Feng. 2025. Latent Inter-User Difference Modeling for LLM Personalization. arXiv:2507.20849 [cs.CL] https://arxiv.org/abs/2507.20849

work page arXiv 2025
[47]

Yilun Qiu, Xiaoyan Zhao, Yang Zhang, Yimeng Bai, Wenjie Wang, Hong Cheng, Fuli Feng, and Tat-Seng Chua. 2025. Measuring What Makes You Unique: Difference-Aware User Modeling for Enhancing LLM Personalization. InFind- ings of the Association for Computational Linguistics: ACL 2025. Association for Computational Linguistics, 21258–21277

2025
[48]

Nils Reimers and Iryna Gurevych. 2019. Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks. arXiv:1908.10084 [cs.CL] https://arxiv.org/abs/ 1908.10084

work page internal anchor Pith review Pith/arXiv arXiv 2019
[49]

Chris Richardson, Yao Zhang, Kellen Gillespie, Sudipta Kar, Arshdeep Singh, Zeynab Raeesy, Omar Zia Khan, and Abhinav Sethy. 2023. Integrating summa- rization and retrieval for enhanced personalization via large language models. arXiv preprint arXiv:2310.20081(2023)

work page arXiv 2023
[50]

Stephen Robertson, Hugo Zaragoza, et al . 2009. The probabilistic relevance framework: BM25 and beyond.Foundations and trends®in information retrieval 3, 4 (2009), 333–389

2009
[51]

Yuta Saito, Suguru Yaginuma, Yuta Nishino, Hayato Sakata, and Kazuhide Nakata
[52]

InProceedings of the 13th International Conference on Web Search and Data Mining(Houston, TX, USA)(WSDM ’20)

Unbiased Recommender Learning from Missing-Not-At-Random Implicit Feedback. InProceedings of the 13th International Conference on Web Search and Data Mining(Houston, TX, USA)(WSDM ’20). Association for Computing Machinery, New York, NY, USA, 501–509. doi:10.1145/3336191.3371783

work page doi:10.1145/3336191.3371783
[53]

Alireza Salemi, Surya Kallumadi, and Hamed Zamani. 2024. Optimization meth- ods for personalizing large language models through retrieval augmentation. InProceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval. 752–762

2024
[54]

Alireza Salemi, Sheshera Mysore, Michael Bendersky, and Hamed Zamani. 2024. LaMP: When Large Language Models Meet Personalization. InProceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). 7370–7392

2024
[55]

Shibani Santurkar, Esin Durmus, Faisal Ladhak, Cinoo Lee, Percy Liang, and Tatsunori Hashimoto. 2023. Whose opinions do language models reflect?. In International Conference on Machine Learning. PMLR, 29971–30004

2023
[56]

Andrew I Schein, Alexandrin Popescul, Lyle H Ungar, and David M Pennock
[57]

InProceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval

Methods and metrics for cold-start recommendations. InProceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval. 253–260
[58]

Teng Shi, Jun Xu, Xiao Zhang, Xiaoxue Zang, Kai Zheng, Yang Song, and Han Li
[59]

InProceedings of the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval

Retrieval augmented generation with collaborative filtering for personalized text generation. InProceedings of the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval. 1294–1304
[60]

Kaitao Song, Xu Tan, Tao Qin, Jianfeng Lu, and Tie-Yan Liu. 2020. MPNet: Masked and Permuted Pre-training for Language Understanding. arXiv:2004.09297 [cs.CL] https://arxiv.org/abs/2004.09297

work page arXiv 2020
[61]

Nisan Stiennon, Long Ouyang, Jeffrey Wu, Daniel Ziegler, Ryan Lowe, Chelsea Voss, Alec Radford, Dario Amodei, and Paul F Christiano. 2020. Learning to summarize with human feedback.Advances in neural information processing systems33 (2020), 3008–3021

2020
[62]

Chenkai Sun, Ke Yang, Revanth Gangi Reddy, Yi Fung, Hou Pong Chan, Kevin Small, ChengXiang Zhai, and Heng Ji. 2025. Persona-db: Efficient large language model personalization for response prediction with collaborative data refinement. InProceedings of the 31st International Conference on Computational Linguistics. 281–296

2025
[63]

Zhaoxuan Tan, Zheyuan Liu, and Meng Jiang. 2024. Personalized pieces: Efficient personalized large language models through collaborative efforts.arXiv preprint arXiv:2406.10471(2024)

work page arXiv 2024
[64]

Zhaoxuan Tan, Qingkai Zeng, Yijun Tian, Zheyuan Liu, Bing Yin, and Meng Jiang
[65]

Democratizing large language models via personalized parameter-efficient fine-tuning.arXiv preprint arXiv:2402.04401(2024)

work page arXiv 2024
[66]

Qwen Team. 2024. Qwen2.5: A Party of Foundation Models. https://qwenlm. github.io/blog/qwen2.5/

2024
[67]

Yu-Min Tseng, Yu-Chao Huang, Teng-Yun Hsiao, Wei-Lin Chen, Chao-Wei Huang, Yu Meng, and Yun-Nung Chen. 2024. Two Tales of Persona in LLMs: A Survey of Role-Playing and Personalization. InFindings of the Association for Computational Linguistics: EMNLP 2024, Yaser Al-Onaizan, Mohit Bansal, and Yun-Nung Chen (Eds.). Association for Computational Linguistics...

work page doi:10.18653/v1/2024.findings-emnlp.969 2024
[68]

Hongru Wang, Wenyu Huang, Yang Deng, Rui Wang, Zezhong Wang, Yufei Wang, Fei Mi, Jeff Z Pan, and Kam-Fai Wong. 2024. Unims-rag: A unified multi- source retrieval-augmented generation for personalized dialogue systems.arXiv preprint arXiv:2401.13256(2024)

work page arXiv 2024
[69]

An Yang, Baosong Yang, Binyuan Hui, Bo Zheng, Bowen Yu, Chang Zhou, Cheng- peng Li, Chengyuan Li, Dayiheng Liu, Fei Huang, Guanting Dong, Haoran Wei, Huan Lin, Jialong Tang, Jialin Wang, Jian Yang, Jianhong Tu, Jianwei Zhang, Jianxin Ma, Jin Xu, Jingren Zhou, Jinze Bai, Jinzheng He, Junyang Lin, Kai Dang, Keming Lu, Keqin Chen, Kexin Yang, Mei Li, Mingfen...

work page internal anchor Pith review Pith/arXiv arXiv 2024
[70]

Fan Yang, Zheng Chen, Ziyan Jiang, Eunah Cho, Xiaojiang Huang, and Yanbin Lu. 2023. Palr: Personalization aware llms for recommendation.arXiv preprint arXiv:2305.07622(2023)

work page arXiv 2023
[71]

Mert Yazan, Suzan Verberne, and Frederik Situmeang. 2025. Improving RAG for Personalization with Author Features and Contrastive Examples. InEuropean Conference on Information Retrieval. Springer, 408–416

2025
[72]

Saber Zerhoudi and Michael Granitzer. 2024. Personarag: Enhancing retrieval- augmented generation systems with user-centric agents.arXiv preprint arXiv:2407.09394(2024)

work page arXiv 2024
[73]

Jinghao Zhang, Yuting Liu, Wenjie Wang, Qiang Liu, Shu Wu, Liang Wang, and Tat-Seng Chua. 2025. Personalized Text Generation with Contrastive Activation Steering. InProceedings of the 63rd Annual Meeting of the Association for Com- putational Linguistics (Volume 1: Long Papers). Association for Computational Linguistics, 7128–7141

2025
[74]

Kai Zhang, Yejin Kim, and Xiaozhong Liu. 2024. Personalized llm response generation with parameterized memory injection.arXiv preprint arXiv:2404.03565 (2024)

work page arXiv 2024
[75]

Weinberger, and Yoav Artzi

Tianyi Zhang, Varsha Kishore, Felix Wu, Kilian Q. Weinberger, and Yoav Artzi
[76]

In8th International Conference on Learning Representations, ICLR 2020, 2020

BERTScore: Evaluating Text Generation with BERT. In8th International Conference on Learning Representations, ICLR 2020, 2020

2020
[77]

Minjun Zhu, Yixuan Weng, Linyi Yang, and Yue Zhang. 2024. Personality align- ment of large language models.arXiv preprint arXiv:2408.11779(2024)

work page arXiv 2024
[78]

Yuchen Zhuang, Haotian Sun, Yue Yu, Rushi Qiang, Qifan Wang, Chao Zhang, and Bo Dai. 2024. Hydra: Model factorization framework for black-box llm personalization.Advances in Neural Information Processing Systems37 (2024), 100783–100815. CoPersona: Collaborative Persona Graphs for Robust LLM Personalization KDD ’26, August 09–13, 2026, Jeju Island, Republi...

2024

[1] [1]

Nor Aniza Abdullah, Rasheed Abubakar Rasheed, Mohd Hairul Nizam Md Nasir, and Md Mujibur Rahman. 2021. Eliciting auxiliary information for cold start user recommendation: A survey.Applied Sciences11, 20 (2021), 9608

2021

[2] [2]

Gati V Aher, Rosa I Arriaga, and Adam Tauman Kalai. 2023. Using large language models to simulate multiple humans and replicate human subject studies. In International conference on machine learning. PMLR, 337–371

2023

[3] [3]

Steven Au, Cameron J Dimacali, Ojasmitha Pedirappagari, Namyong Park, Franck Dernoncourt, Yu Wang, Nikos Kanakaris, Hanieh Deilamsalehy, Ryan A Rossi, and Nesreen K Ahmed. 2025. Personalized graph-based retrieval for large language models.arXiv preprint arXiv:2501.02157(2025)

work page arXiv 2025

[4] [4]

Satanjeev Banerjee and Alon Lavie. 2005. METEOR: An automatic metric for MT evaluation with improved correlation with human judgments. InProceedings of the acl workshop on intrinsic and extrinsic evaluation measures for machine translation and/or summarization. 65–72

2005

[5] [5]

Iz Beltagy, Matthew E Peters, and Arman Cohan. 2020. Longformer: The long- document transformer.arXiv preprint arXiv:2004.05150(2020)

work page internal anchor Pith review Pith/arXiv arXiv 2020

[6] [6]

JesúS Bobadilla, Fernando Ortega, Antonio Hernando, and Jesús Bernal. 2012. A collaborative filtering approach to mitigate the new user cold start problem. Knowledge-based systems26 (2012), 225–238

2012

[7] [7]

Ngoc Bui, Hieu Trung Nguyen, Shantanu Kumar, Julian Theodore, Weikang Qiu, Viet Anh Nguyen, and Rex Ying. 2025. Mixture-of-personas language models for population simulation.arXiv preprint arXiv:2504.05019(2025)

work page arXiv 2025

[8] [8]

Jiawei Chen, Hande Dong, Xiang Wang, Fuli Feng, Meng Wang, and Xiangnan He

[9] [9]

arXiv:2010.03240 [cs.IR] https://arxiv.org/abs/2010.03240

Bias and Debias in Recommender System: A Survey and Future Directions. arXiv:2010.03240 [cs.IR] https://arxiv.org/abs/2010.03240

work page arXiv 2010

[10] [10]

Jin Chen, Zheng Liu, Xu Huang, Chenwang Wu, Qi Liu, Gangwei Jiang, Yuanhao Pu, Yuxuan Lei, Xiaolong Chen, Xingmei Wang, et al. 2024. When large language models meet personalization: Perspectives of challenges and opportunities.World Wide Web27, 4 (2024), 42

2024

[11] [11]

Yi-Pei Chen, Noriki Nishida, Hideki Nakayama, and Yuji Matsumoto. 2024. Recent Trends in Personalized Dialogue Generation: A Review of Datasets, Methodolo- gies, and Evaluations. arXiv:2405.17974 [cs.CL] https://arxiv.org/abs/2405.17974

work page arXiv 2024

[12] [12]

J. A. Hartigan and M. A. Wong. 1979. Algorithm AS 136: A K-Means Clustering Algorithm.Applied Statistics28, 1 (1979), 100–108. doi:10.2307/2346830

work page doi:10.2307/2346830 1979

[13] [13]

Liam Hebert, Krishna Sayana, Ambarish Jash, Alexandros Karatzoglou, Sukhdeep Sodhi, Sumanth Doddapaneni, Yanli Cai, and Dima Kuzmin. 2024. Persoma: Per- sonalized soft prompt adapter architecture for personalized language prompting. arXiv preprint arXiv:2408.00960(2024)

work page arXiv 2024

[14] [14]

Yupeng Hou, Jiacheng Li, Zhankui He, An Yan, Xiusi Chen, and Julian McAuley

[15] [15]

Bridging language and items for retrieval and recommendation.arXiv preprint arXiv:2403.03952(2024)

work page internal anchor Pith review Pith/arXiv arXiv 2024

[16] [16]

Edward J Hu, Yelong Shen, Phillip Wallis, Zeyuan Allen-Zhu, Yuanzhi Li, Shean Wang, Lu Wang, Weizhu Chen, et al. 2022. Lora: Low-rank adaptation of large language models.ICLR1, 2 (2022), 3

2022

[17] [17]

Qiushi Huang, Shuai Fu, Xubo Liu, Wenwu Wang, Tom Ko, Yu Zhang, and Lilian Tang. 2023. Learning retrieval augmentation for personalized dialogue generation. InProceedings of the 2023 Conference on Empirical Methods in Natural Language Processing. 2523–2540

2023

[18] [18]

EunJeong Hwang, Bodhisattwa Majumder, and Niket Tandon. 2023. Aligning language models to user opinions. InFindings of the Association for Computational Linguistics: EMNLP 2023. 5906–5919

2023

[19] [19]

Gautier Izacard, Mathilde Caron, Lucas Hosseini, Sebastian Riedel, Piotr Bo- janowski, Armand Joulin, and Edouard Grave. 2021. Unsupervised dense in- formation retrieval with contrastive learning.arXiv preprint arXiv:2112.09118 (2021)

work page internal anchor Pith review Pith/arXiv arXiv 2021

[20] [20]

Joel Jang, Seungone Kim, Bill Yuchen Lin, Yizhong Wang, Jack Hessel, Luke Zettlemoyer, Hannaneh Hajishirzi, Yejin Choi, and Prithviraj Ammanabrolu

[21] [21]

Personalized soups: Personalized large language model alignment via post-hoc parameter merging.arXiv preprint arXiv:2310.11564(2023)

work page arXiv 2023

[22] [22]

Daniel Jurafsky and James H. Martin. 2026.Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition, with Language Models(3rd ed.). Online manuscript released January 6, 2026. https://web.stanford.edu/~jurafsky/slp3/

2026

[23] [23]

Wang-Cheng Kang, Jianmo Ni, Nikhil Mehta, Maheswaran Sathiamoorthy, Lichan Hong, Ed Chi, and Derek Zhiyuan Cheng. 2023. Do llms understand user prefer- ences? evaluating llms on user rating prediction.arXiv preprint arXiv:2305.06474 (2023)

work page arXiv 2023

[24] [24]

Ishita Kumar, Snigdha Viswanathan, Sushrita Yerra, Alireza Salemi, Ryan A Rossi, Franck Dernoncourt, Hanieh Deilamsalehy, Xiang Chen, Ruiyi Zhang, Shubham Agarwal, et al. 2024. Longlamp: A benchmark for personalized long-form text generation.arXiv preprint arXiv:2407.11016(2024)

work page arXiv 2024

[25] [25]

Woosuk Kwon, Zhuohan Li, Siyuan Zhuang, Ying Sheng, Lianmin Zheng, Cody Hao Yu, Joseph Gonzalez, Hao Zhang, and Ion Stoica. 2023. Efficient memory management for large language model serving with pagedattention. In Proceedings of the 29th Symposium on Operating Systems Principles. 611–626

2023

[26] [26]

Cheng Li, Mingyang Zhang, Qiaozhu Mei, Weize Kong, and Michael Bendersky

[27] [27]

InProceedings of the ACM Web Conference 2024

Learning to rewrite prompts for personalized text generation. InProceedings of the ACM Web Conference 2024. 3367–3378

2024

[28] [28]

Cheng Li, Mingyang Zhang, Qiaozhu Mei, Yaqing Wang, Spurthi Amba Hombaiah, Yi Liang, and Michael Bendersky. 2023. Teach LLMs to Personalize–An Approach inspired by Writing Education.arXiv preprint arXiv:2308.07968(2023)

work page arXiv 2023

[29] [29]

Jinming Li, Wentao Zhang, Tian Wang, Guanglei Xiong, Alan Lu, and Gerard Medioni. 2023. GPT4Rec: A generative framework for personalized recommen- dation and user interests interpretation.arXiv preprint arXiv:2304.03879(2023)

work page arXiv 2023

[30] [30]

Xinyu Li, Ruiyang Zhou, Zachary C Lipton, and Liu Leqi. 2024. Personal- ized language modeling from personalized human feedback.arXiv preprint arXiv:2402.05133(2024)

work page arXiv 2024

[31] [31]

Blerina Lika, Kostas Kolomvatsos, and Stathes Hadjiefthymiades. 2014. Facing the cold start problem in recommender systems.Expert systems with applications 41, 4 (2014), 2065–2073

2014

[32] [32]

Chin-Yew Lin. 2004. Rouge: A package for automatic evaluation of summaries. InText summarization branches out. 74–81

2004

[33] [33]

Jiongnan Liu, Yutao Zhu, Shuting Wang, Xiaochi Wei, Erxue Min, Yu Lu, Shuaiqiang Wang, Dawei Yin, and Zhicheng Dou. 2025. Llms+ persona-plug= personalized llms. InProceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). 9373–9385

2025

[34] [34]

Ilya Loshchilov and Frank Hutter. 2017. Decoupled weight decay regularization. arXiv preprint arXiv:1711.05101(2017)

work page internal anchor Pith review Pith/arXiv arXiv 2017

[35] [35]

Zhenyi Lu, Wei Wei, Xiaoye Qu, Xian-Ling Mao, Dangyang Chen, and Jixiong Chen. 2023. Miracle: Towards Personalized Dialogue Generation with Latent- Space Multiple Personal Attribute Control. InFindings of the Association for KDD ’26, August 09–13, 2026, Jeju Island, Republic of Korea Zhang et al. Computational Linguistics: EMNLP 2023, Houda Bouamor, Juan ...

work page doi:10.18653/v1/2023.findings-emnlp.395 2023

[36] [36]

Hanjia Lyu, Song Jiang, Hanqing Zeng, Yinglong Xia, Qifan Wang, Si Zhang, Ren Chen, Chris Leung, Jiajie Tang, and Jiebo Luo. 2024. Llm-rec: Personalized recom- mendation via prompting large language models. InFindings of the Association for Computational Linguistics: NAACL 2024. 583–612

2024

[37] [37]

Wenyu Mao, Jiancan Wu, Weijian Chen, Chongming Gao, Xiang Wang, and Xiangnan He. 2025. Reinforced prompt personalization for recommendation with large language models.ACM Transactions on Information Systems43, 3 (2025), 1–27

2025

[38] [38]

Sheshera Mysore, Zhuoran Lu, Mengting Wan, Longqi Yang, Bahareh Sarrafzadeh, Steve Menezes, Tina Baghaee, Emmanuel Barajas Gonzalez, Jennifer Neville, and Tara Safavi. 2024. Pearl: Personalizing large language model writing assis- tants with generation-calibrated retrievers. InProceedings of the 1st Workshop on Customizable NLP: Progress and Challenges in...

2024

[39] [39]

Jianmo Ni, Jiacheng Li, and Julian McAuley. 2019. Justifying recommendations using distantly-labeled reviews and fine-grained aspects. InProceedings of the 2019 conference on empirical methods in natural language processing and the 9th international joint conference on natural language processing (EMNLP-IJCNLP). 188–197

2019

[40] [40]

OpenAI. 2024. GPT-4 Technical Report. arXiv:2303.08774 [cs.CL] https://arxiv. org/abs/2303.08774

work page internal anchor Pith review Pith/arXiv arXiv 2024

[41] [41]

Kishore Papineni, Salim Roukos, Todd Ward, and Wei-Jing Zhu. 2002. Bleu: a method for automatic evaluation of machine translation. InProceedings of the 40th annual meeting of the Association for Computational Linguistics. 311–318

2002

[42] [42]

Chanwoo Park, Mingyang Liu, Dingwen Kong, Kaiqing Zhang, and Asuman Ozdaglar. 2024. Rlhf from heterogeneous feedback via personalization and pref- erence aggregation.arXiv preprint arXiv:2405.00254(2024)

work page arXiv 2024

[43] [43]

Qiyao Peng, Hongtao Liu, Hongyan Xu, Qing Yang, Minglai Shao, and Wenjun Wang. 2024. LLM: Harnessing Large Language Models for Personalized Review Generation.arXiv preprint arXiv:2407.07487(2024)

work page arXiv 2024

[44] [44]

Sriyash Poddar, Yanming Wan, Hamish Ivison, Abhishek Gupta, and Natasha Jaques. 2024. Personalizing reinforcement learning from human feedback with variational preference learning.Advances in Neural Information Processing Sys- tems37 (2024), 52516–52544

2024

[45] [45]

Matt Post. 2018. A Call for Clarity in Reporting BLEU Scores. InProceedings of the Third Conference on Machine Translation: Research Papers. 186–191

2018

[46] [46]

Yilun Qiu, Tianhao Shi, Xiaoyan Zhao, Fengbin Zhu, Yang Zhang, and Fuli Feng. 2025. Latent Inter-User Difference Modeling for LLM Personalization. arXiv:2507.20849 [cs.CL] https://arxiv.org/abs/2507.20849

work page arXiv 2025

[47] [47]

Yilun Qiu, Xiaoyan Zhao, Yang Zhang, Yimeng Bai, Wenjie Wang, Hong Cheng, Fuli Feng, and Tat-Seng Chua. 2025. Measuring What Makes You Unique: Difference-Aware User Modeling for Enhancing LLM Personalization. InFind- ings of the Association for Computational Linguistics: ACL 2025. Association for Computational Linguistics, 21258–21277

2025

[48] [48]

Nils Reimers and Iryna Gurevych. 2019. Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks. arXiv:1908.10084 [cs.CL] https://arxiv.org/abs/ 1908.10084

work page internal anchor Pith review Pith/arXiv arXiv 2019

[49] [49]

Chris Richardson, Yao Zhang, Kellen Gillespie, Sudipta Kar, Arshdeep Singh, Zeynab Raeesy, Omar Zia Khan, and Abhinav Sethy. 2023. Integrating summa- rization and retrieval for enhanced personalization via large language models. arXiv preprint arXiv:2310.20081(2023)

work page arXiv 2023

[50] [50]

Stephen Robertson, Hugo Zaragoza, et al . 2009. The probabilistic relevance framework: BM25 and beyond.Foundations and trends®in information retrieval 3, 4 (2009), 333–389

2009

[51] [51]

Yuta Saito, Suguru Yaginuma, Yuta Nishino, Hayato Sakata, and Kazuhide Nakata

[52] [52]

InProceedings of the 13th International Conference on Web Search and Data Mining(Houston, TX, USA)(WSDM ’20)

Unbiased Recommender Learning from Missing-Not-At-Random Implicit Feedback. InProceedings of the 13th International Conference on Web Search and Data Mining(Houston, TX, USA)(WSDM ’20). Association for Computing Machinery, New York, NY, USA, 501–509. doi:10.1145/3336191.3371783

work page doi:10.1145/3336191.3371783

[53] [53]

Alireza Salemi, Surya Kallumadi, and Hamed Zamani. 2024. Optimization meth- ods for personalizing large language models through retrieval augmentation. InProceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval. 752–762

2024

[54] [54]

Alireza Salemi, Sheshera Mysore, Michael Bendersky, and Hamed Zamani. 2024. LaMP: When Large Language Models Meet Personalization. InProceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). 7370–7392

2024

[55] [55]

Shibani Santurkar, Esin Durmus, Faisal Ladhak, Cinoo Lee, Percy Liang, and Tatsunori Hashimoto. 2023. Whose opinions do language models reflect?. In International Conference on Machine Learning. PMLR, 29971–30004

2023

[56] [56]

Andrew I Schein, Alexandrin Popescul, Lyle H Ungar, and David M Pennock

[57] [57]

InProceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval

Methods and metrics for cold-start recommendations. InProceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval. 253–260

[58] [58]

Teng Shi, Jun Xu, Xiao Zhang, Xiaoxue Zang, Kai Zheng, Yang Song, and Han Li

[59] [59]

InProceedings of the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval

Retrieval augmented generation with collaborative filtering for personalized text generation. InProceedings of the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval. 1294–1304

[60] [60]

Kaitao Song, Xu Tan, Tao Qin, Jianfeng Lu, and Tie-Yan Liu. 2020. MPNet: Masked and Permuted Pre-training for Language Understanding. arXiv:2004.09297 [cs.CL] https://arxiv.org/abs/2004.09297

work page arXiv 2020

[61] [61]

Nisan Stiennon, Long Ouyang, Jeffrey Wu, Daniel Ziegler, Ryan Lowe, Chelsea Voss, Alec Radford, Dario Amodei, and Paul F Christiano. 2020. Learning to summarize with human feedback.Advances in neural information processing systems33 (2020), 3008–3021

2020

[62] [62]

Chenkai Sun, Ke Yang, Revanth Gangi Reddy, Yi Fung, Hou Pong Chan, Kevin Small, ChengXiang Zhai, and Heng Ji. 2025. Persona-db: Efficient large language model personalization for response prediction with collaborative data refinement. InProceedings of the 31st International Conference on Computational Linguistics. 281–296

2025

[63] [63]

Zhaoxuan Tan, Zheyuan Liu, and Meng Jiang. 2024. Personalized pieces: Efficient personalized large language models through collaborative efforts.arXiv preprint arXiv:2406.10471(2024)

work page arXiv 2024

[64] [64]

Zhaoxuan Tan, Qingkai Zeng, Yijun Tian, Zheyuan Liu, Bing Yin, and Meng Jiang

[65] [65]

Democratizing large language models via personalized parameter-efficient fine-tuning.arXiv preprint arXiv:2402.04401(2024)

work page arXiv 2024

[66] [66]

Qwen Team. 2024. Qwen2.5: A Party of Foundation Models. https://qwenlm. github.io/blog/qwen2.5/

2024

[67] [67]

Yu-Min Tseng, Yu-Chao Huang, Teng-Yun Hsiao, Wei-Lin Chen, Chao-Wei Huang, Yu Meng, and Yun-Nung Chen. 2024. Two Tales of Persona in LLMs: A Survey of Role-Playing and Personalization. InFindings of the Association for Computational Linguistics: EMNLP 2024, Yaser Al-Onaizan, Mohit Bansal, and Yun-Nung Chen (Eds.). Association for Computational Linguistics...

work page doi:10.18653/v1/2024.findings-emnlp.969 2024

[68] [68]

Hongru Wang, Wenyu Huang, Yang Deng, Rui Wang, Zezhong Wang, Yufei Wang, Fei Mi, Jeff Z Pan, and Kam-Fai Wong. 2024. Unims-rag: A unified multi- source retrieval-augmented generation for personalized dialogue systems.arXiv preprint arXiv:2401.13256(2024)

work page arXiv 2024

[69] [69]

An Yang, Baosong Yang, Binyuan Hui, Bo Zheng, Bowen Yu, Chang Zhou, Cheng- peng Li, Chengyuan Li, Dayiheng Liu, Fei Huang, Guanting Dong, Haoran Wei, Huan Lin, Jialong Tang, Jialin Wang, Jian Yang, Jianhong Tu, Jianwei Zhang, Jianxin Ma, Jin Xu, Jingren Zhou, Jinze Bai, Jinzheng He, Junyang Lin, Kai Dang, Keming Lu, Keqin Chen, Kexin Yang, Mei Li, Mingfen...

work page internal anchor Pith review Pith/arXiv arXiv 2024

[70] [70]

Fan Yang, Zheng Chen, Ziyan Jiang, Eunah Cho, Xiaojiang Huang, and Yanbin Lu. 2023. Palr: Personalization aware llms for recommendation.arXiv preprint arXiv:2305.07622(2023)

work page arXiv 2023

[71] [71]

Mert Yazan, Suzan Verberne, and Frederik Situmeang. 2025. Improving RAG for Personalization with Author Features and Contrastive Examples. InEuropean Conference on Information Retrieval. Springer, 408–416

2025

[72] [72]

Saber Zerhoudi and Michael Granitzer. 2024. Personarag: Enhancing retrieval- augmented generation systems with user-centric agents.arXiv preprint arXiv:2407.09394(2024)

work page arXiv 2024

[73] [73]

Jinghao Zhang, Yuting Liu, Wenjie Wang, Qiang Liu, Shu Wu, Liang Wang, and Tat-Seng Chua. 2025. Personalized Text Generation with Contrastive Activation Steering. InProceedings of the 63rd Annual Meeting of the Association for Com- putational Linguistics (Volume 1: Long Papers). Association for Computational Linguistics, 7128–7141

2025

[74] [74]

Kai Zhang, Yejin Kim, and Xiaozhong Liu. 2024. Personalized llm response generation with parameterized memory injection.arXiv preprint arXiv:2404.03565 (2024)

work page arXiv 2024

[75] [75]

Weinberger, and Yoav Artzi

Tianyi Zhang, Varsha Kishore, Felix Wu, Kilian Q. Weinberger, and Yoav Artzi

[76] [76]

In8th International Conference on Learning Representations, ICLR 2020, 2020

BERTScore: Evaluating Text Generation with BERT. In8th International Conference on Learning Representations, ICLR 2020, 2020

2020

[77] [77]

Minjun Zhu, Yixuan Weng, Linyi Yang, and Yue Zhang. 2024. Personality align- ment of large language models.arXiv preprint arXiv:2408.11779(2024)

work page arXiv 2024

[78] [78]

Yuchen Zhuang, Haotian Sun, Yue Yu, Rushi Qiang, Qifan Wang, Chao Zhang, and Bo Dai. 2024. Hydra: Model factorization framework for black-box llm personalization.Advances in Neural Information Processing Systems37 (2024), 100783–100815. CoPersona: Collaborative Persona Graphs for Robust LLM Personalization KDD ’26, August 09–13, 2026, Jeju Island, Republi...

2024