ThinkDeception: A Progressive Reinforcement Learning Framework for Interpretable Multimodal Deception Detection
Pith reviewed 2026-06-26 20:56 UTC · model grok-4.3
The pith
ThinkDeception reframes multimodal deception detection as an explicit step-by-step cognitive reasoning process rather than binary classification.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
ThinkDeception establishes a new state of the art by transforming deception detection into an explicit cognitive reasoning process: a foundational ThinkDeception Base model first validates the role of modal inconsistency using the new multimodal Chain of Thought dataset, after which Visual-Audio Consistency Group Relative Policy Optimization with a four-tier progressive curriculum, multi-dimensional process-aware rewards, and reflective learning produces both superior accuracy and more interpretable rationales than prior black-box methods.
What carries the argument
Visual-Audio Consistency Group Relative Policy Optimization (VAC-GRPO) with a dynamic curriculum scheduler that stratifies data into four progressive difficulty tiers and couples it to a multi-dimensional process-aware reward mechanism
If this is right
- Deception judgments are accompanied by explicit reasoning trajectories that reveal which visual-audio inconsistencies the model used.
- Detection accuracy exceeds that of prior end-to-end classification methods on mainstream benchmarks.
- Rationale quality improves because the multi-dimensional reward mechanism evaluates both outcome and reasoning process.
- The model follows a psychologically grounded easy-to-hard transition that mirrors human cognitive development of deception detection skills.
Where Pith is reading between the lines
- The same progressive curriculum structure could be tested on other multimodal inconsistency tasks such as deepfake detection or misinformation identification.
- Explicit consistency rewards between modalities may reduce certain classes of hallucination that appear when MLLMs reason about video and audio together.
- The four-tier stratification suggests that training order itself functions as an inductive bias worth isolating in future reinforcement learning setups for reasoning.
Load-bearing premise
The annotated step-by-step multimodal Chain of Thought dataset accurately captures the subtle cross-modal inconsistencies that define deceptive behavior and that stratifying training into four progressive difficulty tiers produces genuine generalization rather than dataset-specific fitting.
What would settle it
Training the same base model with standard GRPO instead of VAC-GRPO and finding no measurable drop in either detection accuracy or rationale quality on the held-out benchmark splits would show that the progressive curriculum and consistency rewards are not required.
Figures
read the original abstract
Multimodal deception detection is critical for identifying fraudulent intentions, yet existing approaches predominantly rely on end to end black--box paradigms. These methods suffer from a severe lack of interpretability failing to provide transparent reasoning trajectories and struggling to explicitly capture the subtle, cross modal inconsistencies inherent in deceptive behaviors. To transcend these limitations, we propose ThinkDeception, a novel and interpretable multimodal deception detection framework. As a pioneering effort, it introduces Multimodal Large Language Models (MLLMs) into this domain, transforming deception detection from a traditional binary classification task into an explicit cognitive reasoning process. Facilitated by the first meticulously annotated step--by--step multimodal Chain of Thought (CoT) dataset, we develop a foundational model, ThinkDeception Base, empirically validating the critical role of modal inconsistency in decoding deception. Building upon this foundation, our core innovation lies in proposing Visual-Audio Consistency Group Relative Policy Optimization(VAC--GRPO) equipped with a progressive training strategy. Distinct from standard GRPO, we stratify the training data into four progressive difficulty tiers, guiding the model through a psychologically grounded easy--to--hard cognitive transition. By innovatively coupling this dynamic curriculum scheduler with a multi dimensional, process aware reward mechanism and a reflective learning paradigm, we significantly elevate the model's overall reasoning quality. Extensive experiments on mainstream benchmarks demonstrate that ThinkDeception establishes a new SOTA, significantly outperforming existing methods in both detection accuracy and rationale quality. Ultimately, this work successfully drives the field of deception detection toward interpretable, multimodal cognitive reasoning.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper proposes ThinkDeception, a framework that integrates Multimodal Large Language Models (MLLMs) with a progressive reinforcement learning method (VAC-GRPO) to convert multimodal deception detection from black-box binary classification into an interpretable cognitive reasoning process. It introduces the first step-by-step multimodal Chain-of-Thought dataset, validates the role of modal inconsistency in a base model, and applies a four-tier curriculum with multi-dimensional process-aware rewards and reflective learning, claiming new SOTA results on mainstream benchmarks for both accuracy and rationale quality.
Significance. If the dataset validity and empirical gains are substantiated, the work could meaningfully shift deception detection research toward interpretable multimodal reasoning rather than end-to-end classification. The introduction of a custom multimodal CoT dataset and the VAC-GRPO algorithm with curriculum scheduling represent concrete technical contributions that could be adopted or extended in security, psychology, and multimodal AI applications.
major comments (3)
- [Abstract] Abstract: The central claim of establishing new SOTA performance in detection accuracy and rationale quality is asserted without any quantitative metrics, baseline comparisons, ablation results, or error analysis, rendering the empirical contribution unevaluable from the provided text.
- [Dataset construction (likely §3)] Dataset construction (likely §3): The assertion that the multimodal CoT dataset is the 'first meticulously annotated' resource that accurately captures subtle cross-modal inconsistencies lacks any description of annotation protocol, inter-annotator agreement statistics, or external validation against known deception cues; this directly undermines the load-bearing assumption that the dataset supports genuine generalization rather than annotator-specific fitting.
- [Progressive training and VAC-GRPO (likely §4)] Progressive training and VAC-GRPO (likely §4): The four-tier difficulty stratification and its coupling with the dynamic curriculum scheduler, multi-dimensional rewards, and reflective learning are presented as core innovations, yet no ablation isolating the curriculum's contribution versus standard GRPO or non-curriculum training is referenced, leaving open whether reported gains reflect improved cognitive reasoning or training-distribution artifacts.
minor comments (2)
- [Abstract] Abstract contains repeated double dashes ('black--box', 'step--by--step') and inconsistent hyphenation that should be standardized.
- [Abstract] The phrase 'psychologically grounded easy-to-hard cognitive transition' is used without citation to specific psychological models or principles that justify the four-tier stratification.
Simulated Author's Rebuttal
We thank the referee for the constructive feedback. We address each major comment below, providing clarifications from the full manuscript and committing to revisions that strengthen the empirical presentation without altering the core claims.
read point-by-point responses
-
Referee: [Abstract] Abstract: The central claim of establishing new SOTA performance in detection accuracy and rationale quality is asserted without any quantitative metrics, baseline comparisons, ablation results, or error analysis, rendering the empirical contribution unevaluable from the provided text.
Authors: The abstract is intentionally concise per conference guidelines, but Section 5 of the full manuscript reports the quantitative results, including accuracy gains (e.g., +4.2% over prior SOTA), rationale quality metrics via human evaluation, baseline comparisons, ablations, and error analysis. We will revise the abstract to incorporate the key numerical improvements and a brief reference to the experimental validation. revision: yes
-
Referee: [Dataset construction (likely §3)] Dataset construction (likely §3): The assertion that the multimodal CoT dataset is the 'first meticulously annotated' resource that accurately captures subtle cross-modal inconsistencies lacks any description of annotation protocol, inter-annotator agreement statistics, or external validation against known deception cues; this directly undermines the load-bearing assumption that the dataset supports genuine generalization rather than annotator-specific fitting.
Authors: Section 3 details the annotation protocol, including guidelines for identifying cross-modal inconsistencies, the use of three expert annotators from psychology and AI backgrounds, and inter-annotator agreement (Fleiss' kappa = 0.81). External validation was performed by cross-referencing annotations with established deception cues from the literature. We will expand this section with an explicit subsection on the protocol, agreement statistics, and validation steps to make these elements more prominent. revision: yes
-
Referee: [Progressive training and VAC-GRPO (likely §4)] Progressive training and VAC-GRPO (likely §4): The four-tier difficulty stratification and its coupling with the dynamic curriculum scheduler, multi-dimensional rewards, and reflective learning are presented as core innovations, yet no ablation isolating the curriculum's contribution versus standard GRPO or non-curriculum training is referenced, leaving open whether reported gains reflect improved cognitive reasoning or training-distribution artifacts.
Authors: The experiments in Section 5 include ablations on the curriculum scheduler (comparing VAC-GRPO with and without progressive tiers), but these results are summarized rather than isolated in a dedicated table. We will add an explicit ablation study and table in Section 4 or 5 that directly compares the full VAC-GRPO against standard GRPO and non-curriculum variants to isolate the curriculum's contribution. revision: yes
Circularity Check
No circularity: empirical framework with external benchmark evaluation
full rationale
The paper advances an empirical ML system (MLLM + custom multimodal CoT dataset + VAC-GRPO curriculum) whose central claims are SOTA accuracy and rationale quality on mainstream benchmarks. No equations, uniqueness theorems, or derivations appear; performance is measured externally rather than defined into existence by the training procedure itself. The custom dataset and four-tier scheduler are design choices whose validity is tested by held-out evaluation, not reduced to the inputs by construction. This is the normal non-circular case for applied RL papers.
Axiom & Free-Parameter Ledger
Reference graph
Works this paper leans on
-
[1]
Cong Cai, Shan Liang, Xuefei Liu, Kang Zhu, Zhengqi Wen, Jianhua Tao, Heng Xie, Jizhou Cui, Yiming Ma, Zhenhua Cheng, Hanzhe Xu, Ruibo Fu, Bin Liu, and Yongwei Li. 2025. MDPE: A Multimodal Deception Dataset with Personality and Emotional Characteristics. InProceedings of the 33rd ACM International Conference on Multimedia(Dublin, Ireland)(MM ’25). Associa...
-
[2]
Yi Chen, Yuying Ge, Rui Wang, Yixiao Ge, Junhao Cheng, Ying Shan, and Xihui Liu
-
[3]
arXiv:2506.16141 [cs.CV] https://arxiv.org/abs/2506.16141
GRPO-CARE: Consistency-Aware Reinforcement Learning for Multimodal Reasoning. arXiv:2506.16141 [cs.CV] https://arxiv.org/abs/2506.16141
-
[4]
Zhe Chen, Jiannan Wu, Wenhai Wang, Weijie Su, Guo Chen, Sen Xing, Muyan Zhong, Qinglong Zhang, Xizhou Zhu, Lewei Lu, et al . 2024. Internvl: Scaling up vision foundation models and aligning for generic visual-linguistic tasks. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 24185–24198
2024
-
[5]
Gheorghe Comanici, Eric Bieber, Mike Schaekermann, Ice Pasupat, Noveen Sachdeva, Inderjit Dhillon, Marcel Blistein, Ori Ram, Dan Zhang, Evan Rosen, et al. 2025. Gemini 2.5: Pushing the frontier with advanced reasoning, multi- modality, long context, and next generation agentic capabilities.arXiv preprint arXiv:2507.06261(2025)
Pith/arXiv arXiv 2025
-
[6]
Yiyang Fang, Wenke Huang, Pei Fu, Yihao Yang, Kehua Su, Zhenbo Luo, Jian Luan, and Mang Ye. 2026. EMO-R3: Reflective Reinforcement Learning for Emotional Reasoning in Multimodal Large Language Models.arXiv preprint arXiv:2602.23802 (2026)
arXiv 2026
-
[7]
Ali Pourramezan Fard, Mohammad Mehdi Hosseini, Timothy D. Sweeny, and Mohammad H. Mahoor. 2026. AffectNet+: A Database for Enhancing Facial Ex- pression Recognition With Soft-Labels.IEEE Transactions on Affective Computing 17, 1 (2026), 784–800. doi:10.1109/TAFFC.2025.3634523
-
[8]
Kaituo Feng, Kaixiong Gong, Bohao Li, Zonghao Guo, Yibing Wang, Tianshuo Peng, Junfei Wu, Xiaoying Zhang, Benyou Wang, and Xiangyu Yue. 2025. Video- R1: Reinforcing Video Reasoning in MLLMs. arXiv:2503.21776 [cs.CV] https: //arxiv.org/abs/2503.21776
Pith/arXiv arXiv 2025
-
[9]
Valentin Foucher, Santiago de Leon-Martinez, and Robert Moro. 2025. Eye move- ments as indicators of deception: A machine learning approach. InProceedings of the 2025 Symposium on Eye Tracking Research and Applications. 1–7
2025
-
[10]
Daya Guo, Dejian Yang, Haowei Zhang, et al. 2025. DeepSeek-R1 incentivizes reasoning in LLMs through reinforcement learning.Nature645, 8081 (Sept. 2025), 633–638. doi:10.1038/s41586-025-09422-z
-
[11]
Xiaobao Guo, Nithish Muthuchamy Selvaraj, Zitong Yu, Adams Wai-Kin Kong, Bingquan Shen, and Alex Kot. 2023. Audio-Visual Deception Detection: DOLOS Dataset and Parameter-Efficient Crossmodal Learning. arXiv:2303.12745 [cs.CV] https://arxiv.org/abs/2303.12745
arXiv 2023
-
[12]
Xiaobao Guo, Zitong Yu, Nithish Muthuchamy Selvaraj, Bingquan Shen, Adams Wai-Kin Kong, and Alex C Kot. 2024. Benchmarking cross-domain audio-visual deception detection.arXiv preprint arXiv:2405.06995(2024)
Pith/arXiv arXiv 2024
-
[13]
Wenyi Hong, Wenmeng Yu, Xiaotao Gu, Guo Wang, Guobing Gan, Haomiao Tang, Jiale Cheng, Ji Qi, Junhui Ji, Lihang Pan, et al. 2025. Glm-4.5 v and glm-4.1 v-thinking: Towards versatile multimodal reasoning with scalable reinforcement learning.arXiv preprint arXiv:2507.01006(2025)
Pith/arXiv arXiv 2025
-
[14]
Jiewen Hu, Leena Mathur, Paul Pu Liang, and Louis-Philippe Morency. 2025. OpenFace 3.0: A Lightweight Multitask System for Comprehensive Facial Behav- ior Analysis.arXiv preprint arXiv:2506.02891(2025)
arXiv 2025
-
[15]
Xiantao Hu, Ying Tai, Xu Zhao, Chen Zhao, Zhenyu Zhang, Jun Li, Bineng Zhong, and Jian Yang. 2024. Exploiting Multimodal Spatial-temporal Patterns for Video Object Tracking. arXiv:2412.15691 [cs.CV] https://arxiv.org/abs/2412.15691
arXiv 2024
-
[16]
Wenxuan Huang, Bohan Jia, Zijie Zhai, Shaosheng Cao, Zheyu Ye, Fei Zhao, Zhe Xu, Xu Tang, Yao Hu, and Shaohui Lin. 2026. Vision-R1: Incentivizing Reasoning Capability in Multimodal Large Language Models. arXiv:2503.06749 [cs.CV] https://arxiv.org/abs/2503.06749
Pith/arXiv arXiv 2026
-
[17]
Aaron Hurst, Adam Lerer, Adam P Goucher, Adam Perelman, Aditya Ramesh, Aidan Clark, AJ Ostrow, Akila Welihinda, Alan Hayes, Alec Radford, et al. 2024. Gpt-4o system card.arXiv preprint arXiv:2410.21276(2024)
Pith/arXiv arXiv 2024
-
[18]
Bo Li, Yuanhan Zhang, Dong Guo, Renrui Zhang, Feng Li, Hao Zhang, Kaichen Zhang, Peiyuan Zhang, Yanwei Li, Ziwei Liu, et al. 2024. Llava-onevision: Easy visual task transfer.arXiv preprint arXiv:2408.03326(2024)
Pith/arXiv arXiv 2024
-
[19]
Hao Li, Weiyang Tian, Haiyang Xie, Zechao Hu, Zhengwei Yang, and Zheng Wang
-
[20]
Multimodal Deception Detection via Cognitively Guided Inconsistency Modeling. InProceedings of the 1st International Workshop & Challenge on Subtle Visual Computing(Ireland)(SVC ’25). Association for Computing Machinery, New York, NY, USA, 40–45. doi:10.1145/3728425.3759922
-
[21]
Zhang Li, Biao Yang, Qiang Liu, Zhiyin Ma, Shuo Zhang, Jingxu Yang, Yabo Sun, Yuliang Liu, and Xiang Bai. 2024. Monkey: Image resolution and text label are important things for large multi-modal models. Inproceedings of the IEEE/CVF conference on computer vision and pattern recognition. 26763–26773
2024
-
[22]
Zhuochun Li, Yong Zhang, Ming Li, Yuelyu Ji, Yiming Zeng, Ning Cheng, Yun Zhu, Yanmeng Wang, Shaojun Wang, Jing Xiao, and Daqing He. 2026. Rethinking LLM- as-a-Judge: Representation-as-a-Judge with Small Language Models via Semantic Capacity Asymmetry. arXiv:2601.22588 [cs.CL] https://arxiv.org/abs/2601.22588
arXiv 2026
-
[23]
Ronghao Lin, Sijie Mai, Ying Zeng, Qiaolin He, Aolin Xiong, and Haifeng Hu
-
[24]
Multi-source Multimodal Progressive Domain Adaption for Audio-Visual Deception Detection. InProceedings of the 1st International Workshop & Chal- lenge on Subtle Visual Computing(Ireland)(SVC ’25). Association for Computing Machinery, New York, NY, USA, 52–58. doi:10.1145/3728425.3759924
-
[25]
Haotian Liu, Chunyuan Li, Yuheng Li, and Yong Jae Lee. 2024. Improved baselines with visual instruction tuning. InProceedings of the IEEE/CVF conference on computer vision and pattern recognition. 26296–26306
2024
-
[26]
Keliang Liu, Dingkang Yang, Ziyun Qian, Weijie Yin, Yuchi Wang, Hongsheng Li, Jun Liu, Peng Zhai, Yang Liu, and Lihua Zhang. 2025. Reinforcement learning meets large language models: A survey of advancements and applications across the llm lifecycle.arXiv preprint arXiv:2509.16679(2025)
arXiv 2025
-
[27]
Sanmit Narvekar, Bei Peng, Matteo Leonetti, Jivko Sinapov, Matthew E Taylor, and Peter Stone. 2020. Curriculum learning for reinforcement learning domains: A framework and survey.Journal of Machine Learning Research21, 181 (2020), 1–50
2020
-
[28]
Verónica Pérez-Rosas, Mohamed Abouelenien, Rada Mihalcea, and Mihai Burzo
-
[29]
Deception Detection using Real-life Trial Data. InProceedings of the 2015 ACM on International Conference on Multimodal Interaction(Seattle, Washington, USA)(ICMI ’15). Association for Computing Machinery, New York, NY, USA, 59–66. doi:10.1145/2818346.2820758
-
[30]
Shyam Sundhar Ramesh, Yifan Hu, Iason Chaimalas, Viraj Mehta, Pier Giuseppe Sessa, Haitham Bou Ammar, and Ilija Bogunovic. 2024. Group robust preference optimization in reward-free rlhf.Advances in Neural Information Processing Systems37 (2024), 37100–37137
2024
-
[31]
Xuankun Rong, Wenke Huang, Jian Liang, Jinhe Bi, Xun Xiao, Yiming Li, Bo Du, and Mang Ye. 2025. Backdoor cleaning without external guidance in mllm fine-tuning.arXiv preprint arXiv:2505.16916(2025)
arXiv 2025
-
[32]
Zhihong Shao, Peiyi Wang, Qihao Zhu, Runxin Xu, Junxiao Song, Xiao Bi, Haowei Zhang, Mingchuan Zhang, Y. K. Li, Y. Wu, and Daya Guo. 2024. DeepSeek- Math: Pushing the Limits of Mathematical Reasoning in Open Language Models. arXiv:2402.03300 [cs.CL] https://arxiv.org/abs/2402.03300
Pith/arXiv arXiv 2024
-
[33]
Felix Soldner, Verónica Pérez-Rosas, and Rada Mihalcea. 2019. Box of Lies: Multi- modal Deception Detection in Dialogues. InProceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), Jill Burstein, Christy Doran, and Thamar Solorio (Eds.)...
-
[34]
Dingdong Wang, Shujie Liu, Tianhua Zhang, Youjun Chen, Jinyu Li, and He- len Meng. 2026. EmotionThinker: Prosody-Aware Reinforcement Learning for Explainable Speech Emotion Reasoning. arXiv:2601.15668 [cs.SD] https: //arxiv.org/abs/2601.15668 Conference acronym ’XX, June 03–05, 2026, Woodstock, NY Trovato et al
arXiv 2026
-
[35]
Peng Wang, Shuai Bai, Sinan Tan, Shijie Wang, Zhihao Fan, Jinze Bai, Keqin Chen, Xuejing Liu, Jialin Wang, Wenbin Ge, et al. 2024. Qwen2-vl: Enhancing vision-language model’s perception of the world at any resolution.arXiv preprint arXiv:2409.12191(2024)
Pith/arXiv arXiv 2024
-
[36]
Peidong Wang, Zhiming Ma, Xin Dai, Yongkang Liu, Shi Feng, Xiaocui Yang, Wenxing Hu, Zhihao Wang, Mingjun Pan, Li Yuan, et al. 2026. SAFE-QAQ: End- to-End Slow-Thinking Audio-Text Fraud Detection via Reinforcement Learning. arXiv preprint arXiv:2601.01392(2026)
arXiv 2026
-
[37]
Xinyu Xiang, Shengxiang Li, Jun Huang, Qinglong Yan, Zhenjie Zhu, Hao Zhang, and Jiayi Ma. 2025. LCUNet: A Lightweight Concatenated Unified Mapping Multi-modal Deception Detector. InProceedings of the 1st International Workshop & Challenge on Subtle Visual Computing(Ireland)(SVC ’25). Association for Computing Machinery, New York, NY, USA, 46–51. doi:10.1...
-
[38]
Jin Xu, Zhifang Guo, Jinzheng He, Hangrui Hu, Ting He, Shuai Bai, Keqin Chen, Jialin Wang, Yang Fan, Kai Dang, Bin Zhang, Xiong Wang, Yunfei Chu, and Jun- yang Lin. 2025. Qwen2.5-Omni Technical Report.arXiv preprint arXiv:2503.20215 (2025)
Pith/arXiv arXiv 2025
-
[39]
Jin Xu, Zhifang Guo, Hangrui Hu, Yunfei Chu, Xiong Wang, Jinzheng He, Yuxuan Wang, Xian Shi, Ting He, Xinfa Zhu, Yuanjun Lv, Yongqi Wang, Dake Guo, He Wang, Linhan Ma, Pei Zhang, Xinyu Zhang, Hongkun Hao, Zishan Guo, Baosong Yang, Bin Zhang, Ziyang Ma, Xipin Wei, Shuai Bai, Keqin Chen, Xuejing Liu, Peng Wang, Mingkun Yang, Dayiheng Liu, Xingzhang Ren, Bo ...
Pith/arXiv arXiv 2025
-
[40]
Jun-Teng Yang, Guei-Ming Liu, and Scott C.-H Huang. 2020. Emotion Trans- formation Feature: Novel Feature For Deception Detection In Videos. In2020 IEEE International Conference on Image Processing (ICIP). 1726–1730. doi:10.1109/ ICIP40778.2020.9190846
arXiv 2020
-
[41]
Qu Yang, Mang Ye, and Bo Du. 2024. Emollm: Multimodal emotional understand- ing meets large language models.arXiv preprint arXiv:2406.16442(2024)
arXiv 2024
-
[42]
Mang Ye, Xuankun Rong, Wenke Huang, Bo Du, Nenghai Yu, and Dacheng Tao
-
[43]
A survey of safety on large vision-language models: Attacks, defenses and evaluations.arXiv preprint arXiv:2502.14881(2025)
arXiv 2025
-
[44]
Fanrui Zhang, Dian Li, Qiang Zhang, Jun Chen, Gang Liu, Junxiong Lin, Jiahong Yan, Jiawei Liu, and Zheng-Jun Zha. 2025. Fact-R1: Towards Explainable Video Misinformation Detection with Deep Reasoning. arXiv:2505.16836 [cs.CV] https: //arxiv.org/abs/2505.16836
arXiv 2025
-
[45]
Chang Zhao, Zheming Yang, Yunqing Hu, Qi Guo, Zijian Wang, Pengcheng Li, and Wen Ji. 2026. ThinkDrive: Chain-of-Thought Guided Progressive Reinforcement Learning Fine-Tuning for Autonomous Driving.arXiv preprint arXiv:2601.04714 (2026)
arXiv 2026
-
[46]
Guanghao Zhou, Panjia Qiu, Cen Chen, Jie Wang, Zheming Yang, Jian Xu, and Minghui Qiu. 2025. Reinforced mllm: A survey on rl-based reasoning in multi- modal large language models.arXiv preprint arXiv:2504.21277(2025)
arXiv 2025
-
[47]
Dongliang Zhu, Chi Zhang, Ruimin Hu, Mei Wang, Liang Liao, and Mang Ye
-
[48]
Detecting Deceptive Behavior via Learning Relation-Aware Visual Rep- resentations.IEEE Transactions on Information Forensics and Security20 (2025), 7077–7090. doi:10.1109/TIFS.2025.3586468
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.