pith. sign in

arxiv: 2606.26403 · v1 · pith:DK5LAJPVnew · submitted 2026-06-24 · 💻 cs.CL

ProfileFoundry: A Synthetic Person-Object Substrate for Privacy, Memory, and Tool-Use Evaluation in LLM Agent

Pith reviewed 2026-06-26 01:19 UTC · model grok-4.3

classification 💻 cs.CL
keywords synthetic dataperson objectsLLM evaluationprivacymemoryagent statedata consistencysynthetic persons
0
0 comments X

The pith

ProfileFoundry supplies 100,000 synthetic person objects with enforced consistency for LLM evaluations.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper establishes ProfileFoundry as a deterministic generator and reference release of 100,000 synthetic adult person objects spanning eight locales. These objects incorporate consistent current snapshots, relational links to households, families, and employers, along with aligned events and provenance tracking. This setup addresses the challenge of obtaining shareable data for evaluating foundation models on tasks requiring personal context. Readers would care if it allows safe, controlled testing of how AI agents handle memory, privacy, and tool interactions with people.

Core claim

We present ProfileFoundry, a deterministic generator and fixed reference release of 100,000 adult synthetic Person Objects across eight locales. Each object combines a typed current snapshot, household, family, and employer links, snapshot-aligned events, normalized relational views, and generation provenance. The release contains 709,228 events, 40,338 households, 52,491 employers, and 518,564 directed relationship edges. We report evidence in separate categories: selected population-marginal comparisons, per-object invariant checks, release-wide referential and temporal closure, and coincidence/provenance screens. ProfileFoundry is a responsible synthetic source layer for constructing down

What carries the argument

The deterministic generator of synthetic Person Objects that enforces cross-field and temporal consistency through linked snapshots, events, and relationships.

If this is right

  • Supports evaluations of LLM agent memory using consistent personal histories and events.
  • Allows testing of privacy mechanisms with known synthetic person details.
  • Enables tool-use assessments in scenarios involving household, family, and employer links.
  • Provides a basis for record linkage and document understanding tasks with referential closure.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The dataset could support testing of long-term memory in agents by generating future updates aligned with existing events.
  • Researchers might use the provenance to trace how inconsistencies affect agent performance in controlled experiments.
  • Extensions could include generating documents or files linked to each person object for richer tool-use scenarios.

Load-bearing premise

The generator's checks produce cross-field and temporal consistency at a level that makes the objects usable for downstream LLM evaluations.

What would settle it

A test revealing frequent inconsistencies, such as mismatched ages with birth dates or events not aligning with household compositions, in a sample of the released objects.

Figures

Figures reproduced from arXiv: 2606.26403 by Anneswa Ghosh, Sriram Selvam.

Figure 1
Figure 1. Figure 1: A sample released person object from en-US locale. [PITH_FULL_IMAGE:figures/full_fig_p002_1.png] view at source ↗
Figure 2
Figure 2. Figure 2: Constraint influence graph, the full dependency map: which factor constrains [PITH_FULL_IMAGE:figures/full_fig_p005_2.png] view at source ↗
Figure 3
Figure 3. Figure 3: Constrained cascade generation. PROFILEFOUNDRY carries constraints forward from reference tables and household plans into person fields, represented-link closure, snapshot-aligned temporal backfill, and export-time evidence checks. Household-first generation is an engineering design choice; without an ablation, this paper does not claim causal superiority over every alternative. Generator mechanics. The ge… view at source ↗
Figure 4
Figure 4. Figure 4: Age-gated constraint atlas for the en-US generator rules. Age is the master gate: [PITH_FULL_IMAGE:figures/full_fig_p013_4.png] view at source ↗
Figure 5
Figure 5. Figure 5: Education to career tier to finance signature. Education indices gate which career [PITH_FULL_IMAGE:figures/full_fig_p013_5.png] view at source ↗
Figure 6
Figure 6. Figure 6: Outlier policy: what is blocked, bent, or common. [PITH_FULL_IMAGE:figures/full_fig_p014_6.png] view at source ↗
Figure 7
Figure 7. Figure 7: Snapshot-aligned temporal backfill. Histories are reconstructed backward from [PITH_FULL_IMAGE:figures/full_fig_p014_7.png] view at source ↗
Figure 8
Figure 8. Figure 8: Release inventory and object topology. Raw row counts orient the reader on a log [PITH_FULL_IMAGE:figures/full_fig_p016_8.png] view at source ↗
Figure 9
Figure 9. Figure 9: Multi-surface coverage and per-profile density. Every profile carries a current [PITH_FULL_IMAGE:figures/full_fig_p018_9.png] view at source ↗
Figure 10
Figure 10. Figure 10: Households resolved into a directed relationship graph. Three quarters of the [PITH_FULL_IMAGE:figures/full_fig_p018_10.png] view at source ↗
Figure 11
Figure 11. Figure 11: Temporal release surface. Typed events project into address and employment [PITH_FULL_IMAGE:figures/full_fig_p019_11.png] view at source ↗
Figure 12
Figure 12. Figure 12: Employer context exported as resolvable entities rather than free-text names. [PITH_FULL_IMAGE:figures/full_fig_p019_12.png] view at source ↗
Figure 13
Figure 13. Figure 13: Claim-to-evidence ledger. Each headline capability of the release—structured, [PITH_FULL_IMAGE:figures/full_fig_p020_13.png] view at source ↗
Figure 14
Figure 14. Figure 14: Audit attachment map. Every generation stage carries its own validator family [PITH_FULL_IMAGE:figures/full_fig_p021_14.png] view at source ↗
Figure 15
Figure 15. Figure 15: Validation target audit: honest misses alongside invariant pass. Distributional fit [PITH_FULL_IMAGE:figures/full_fig_p022_15.png] view at source ↗
Figure 16
Figure 16. Figure 16: Release-wide referential and temporal closure. Relationship endpoints resolve to [PITH_FULL_IMAGE:figures/full_fig_p023_16.png] view at source ↗
Figure 17
Figure 17. Figure 17: Collision and coincidence screening. The release publishes exact within-release [PITH_FULL_IMAGE:figures/full_fig_p024_17.png] view at source ↗
Figure 18
Figure 18. Figure 18: Reproducibility pin. The release records the global seed, generation date, export [PITH_FULL_IMAGE:figures/full_fig_p025_18.png] view at source ↗
read the original abstract

Foundation-model research increasingly needs data about people: user state, personal histories, relationships, contact-like fields, documents, and longitudinal updates. Real user data is difficult to share, perturb, audit, or redistribute responsibly, while independently generated fake fields rarely preserve the cross-field and temporal consistency needed for controlled evaluation. We present PROFILEFOUNDRY, a deterministic generator and fixed reference release of 100,000 adult synthetic Person Objects across eight locales. Each object combines a typed current snapshot, household, family, and employer links, snapshot-aligned events, normalized relational views, and generation provenance. The release contains 709,228 events, 40,338 households, 52,491 employers, and 518,564 directed relationship edges. We report evidence in separate categories: selected population-marginal comparisons, per-object invariant checks, release-wide referential and temporal closure, and coincidence/provenance screens. PROFILEFOUNDRY is not a population-fidelity model, a rendered-text corpus, or a formal privacy mechanism. Instead, it is a responsible synthetic source layer for constructing downstream foundation-model evaluations involving memory, privacy, document understanding, record linkage, and agent state while keeping the synthetic person behind each artifact inspectable

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

0 major / 3 minor

Summary. The manuscript presents PROFILEFOUNDRY, a deterministic generator and fixed reference release of 100,000 adult synthetic Person Objects across eight locales. Each object includes a typed current snapshot, household/family/employer links, snapshot-aligned events, normalized relational views, and generation provenance. The release contains 709,228 events, 40,338 households, 52,491 employers, and 518,564 directed relationship edges. Evidence for consistency and closure is reported in four categories: population-marginal comparisons, per-object invariant checks, release-wide referential/temporal closure, and coincidence/provenance screens. The resource is positioned as a synthetic substrate for downstream LLM agent evaluations involving memory, privacy, document understanding, record linkage, and tool use, with the synthetic persons remaining inspectable.

Significance. If the reported checks establish the claimed consistency at a usable level, this fixed release supplies a reproducible, auditable, and shareable person-like substrate that directly addresses privacy barriers in foundation-model research. The deterministic generation, explicit provenance, and multi-category verification approach enable controlled, falsifiable experiments that are difficult to conduct with real user data. The emphasis on inspectability and separation from both population-fidelity modeling and formal privacy mechanisms is a constructive contribution to evaluation infrastructure.

minor comments (3)
  1. [Abstract] Abstract: The abstract asserts that evidence is supplied across four explicit categories but does not include any quantitative outcomes, example statistics, or table references from those checks. Adding at least one representative metric per category would make the summary self-contained.
  2. The manuscript states that PROFILEFOUNDRY is not a population-fidelity model, yet no brief comparison to existing synthetic person or household generators appears in the related-work discussion. A short paragraph situating the generator relative to prior work would clarify its distinctive properties.
  3. The release counts (events, households, employers, edges) are given, but the manuscript does not indicate the exact public artifact location, file formats, or licensing terms. Explicit pointers and a one-paragraph usage note would improve immediate adoptability.

Simulated Author's Rebuttal

0 responses · 0 unresolved

We thank the referee for their accurate summary of ProfileFoundry and for the positive evaluation of its potential contribution as a synthetic evaluation substrate. The recommendation of minor revision is noted; we will prepare a revised manuscript once any specific editorial or minor suggestions are provided.

Circularity Check

0 steps flagged

No significant circularity

full rationale

The paper describes a deterministic generator and fixed release of 100k synthetic Person Objects, supported by four categories of reported checks (population-marginal comparisons, per-object invariants, referential/temporal closure, coincidence/provenance screens). No equations, parameters, derivations, or load-bearing self-citations appear. The consistency claim is presented as directly testable from the released artifact rather than reduced to any internal fit or prior self-referential result. This matches the default expectation of a non-circular data-generation paper.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 1 invented entities

The generator depends on internal consistency rules and locale-specific parameters whose details are not supplied in the abstract; the Person Object is the core invented structure.

axioms (1)
  • domain assumption Synthetic data can be constructed to maintain cross-field and temporal consistency sufficient for controlled LLM evaluations
    Invoked in abstract paragraph 2 as the key requirement that prior fake data fails to meet.
invented entities (1)
  • Person Object no independent evidence
    purpose: Structured synthetic representation combining snapshot, links, events, and provenance
    Central output of the generator; no external falsifiable validation beyond internal checks is described.

pith-pipeline@v0.9.1-grok · 5752 in / 1490 out tokens · 30669 ms · 2026-06-26T01:19:00.233820+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

37 extracted references · 3 canonical work pages · 1 internal anchor

  1. [1]

    28th USENIX Security Symposium , pages =

    The Secret Sharer: Evaluating and Testing Unintended Memorization in Neural Networks , author =. 28th USENIX Security Symposium , pages =. 2019 , url =

  2. [2]

    30th USENIX Security Symposium , pages =

    Extracting Training Data from Large Language Models , author =. 30th USENIX Security Symposium , pages =. 2021 , url =

  3. [3]

    Christen, Peter , booktitle =

  4. [4]

    2025 , url =

    Faker: Python Package That Generates Fake Data , author =. 2025 , url =

  5. [5]

    2024 , eprint =

    Scaling Synthetic Data Creation with 1,000,000,000 Personas , author =. 2024 , eprint =

  6. [6]

    Communications of the ACM , volume =

    Datasheets for Datasets , author =. Communications of the ACM , volume =. 2021 , doi =

  7. [7]

    2024 , howpublished =

    Synthetic. 2024 , howpublished =

  8. [8]

    Gates Open Research , volume =

    Simulated Data for Census-Scale Entity Resolution Research Without Privacy Restrictions: A Large-Scale Dataset Generated by Individual-Based Modeling , author =. Gates Open Research , volume =. 2024 , doi =

  9. [9]
  10. [10]

    Know Me, Respond to Me: Benchmarking

    Jiang, Bowen and Hao, Zhuoqun and Cho, Young Min and Li, Bryan and Yuan, Yuan and Chen, Sihao and Ungar, Lyle and Taylor, Camillo Jose and Roth, Dan , booktitle =. Know Me, Respond to Me: Benchmarking. 2025 , url =

  11. [11]

    Edward and Oh, Sewoong and Tsvetkov, Yulia and Koh, Pang Wei and Choi, Yejin , year =

    Kim, Hyunwoo and Mireshghallah, Niloofar and Duan, Michael and Xin, Rui and Li, Shuyue Stella and Jung, Jaehun and Acuna, David and Pang, Qi and Xiao, Hanshen and Suh, G. Edward and Oh, Sewoong and Tsvetkov, Yulia and Koh, Pang Wei and Choi, Yejin , year =. 2602.03183 , archivePrefix =

  12. [12]

    Evaluating Very Long-Term Conversational Memory of

    Maharana, Adyasha and Lee, Dong-Ho and Tulyakov, Sergey and Bansal, Mohit and Barbieri, Francesco and Fang, Yuwei , booktitle =. Evaluating Very Long-Term Conversational Memory of. 2024 , doi =

  13. [13]

    2025 , url =

    Mimesis: Fake Data Generator , author =. 2025 , url =

  14. [14]

    Proceedings of the Conference on Fairness, Accountability, and Transparency , pages =

    Model Cards for Model Reporting , author =. Proceedings of the Conference on Fairness, Accountability, and Transparency , pages =. 2019 , doi =

  15. [15]

    and Dibben, Chris , journal =

    Nowok, Beata and Raab, Gillian M. and Dibben, Chris , journal =. 2016 , doi =

  16. [16]

    2025 , howpublished =

  17. [17]

    Proceedings of the 36th Annual ACM Symposium on User Interface Software and Technology , year =

    Generative Agents: Interactive Simulacra of Human Behavior , author =. Proceedings of the 36th Annual ACM Symposium on User Interface Software and Technology , year =

  18. [18]

    2016 IEEE International Conference on Data Science and Advanced Analytics , pages =

    The Synthetic Data Vault , author =. 2016 IEEE International Conference on Data Science and Advanced Analytics , pages =. 2016 , doi =

  19. [19]

    Data Cards: Purposeful and Transparent Dataset Documentation for Responsible

    Pushkarna, Mahima and Zaldivar, Andrew and Kjartansson, Oddur , journal =. Data Cards: Purposeful and Transparent Dataset Documentation for Responsible. 2022 , doi =

  20. [20]

    2025 , url =

    Pydantic: Data Validation Using Python Type Hints , author =. 2025 , url =

  21. [21]

    , year =

    Eastlake, Donald and Panitz, A. , year =

  22. [22]

    2024 , url =

    Salemi, Alireza and Mysore, Sheshera and Bendersky, Michael and Zamani, Hamed , booktitle =. 2024 , url =

  23. [23]

    2025 , doi =

    Savkin, Maksim and Ionov, Timur and Konovalov, Vasily , booktitle =. 2025 , doi =

  24. [24]

    2025 , url =

    Selvam, Sriram and Ghosh, Anneswa , journal =. 2025 , url =

  25. [25]

    International Conference on Learning Representations , year =

    Beyond Memorization: Violating Privacy via Inference with Large Language Models , author =. International Conference on Learning Representations , year =

  26. [26]

    2502.20616 , archivePrefix =

    Tan, Juntao and Yang, Liangwei and Liu, Zuxin and Liu, Zhiwei and Murthy, Rithesh and Awalgaonkar, Tulika Manoj and Zhang, Jianguo and Yao, Weiran and Zhu, Ming and Kokane, Shirley and Savarese, Silvio and Wang, Huan and Xiong, Caiming and Heinecke, Shelby , year =. 2502.20616 , archivePrefix =

  27. [27]

    2013 , doi =

    Tran, Khoi-Nguyen and Vatsalan, Dinusha and Christen, Peter , booktitle =. 2013 , doi =

  28. [28]

    Communications of the ACM , volume =

    Wikidata: A Free Collaborative Knowledgebase , author =. Communications of the ACM , volume =. 2014 , doi =

  29. [29]

    2018 , doi =

    Walonoski, Jason and Kramer, Mark and Nichols, Joseph and Quina, Andre and Moesel, Chris and Hall, Dylan and Duffett, Carlton and Dube, Kudakwashe and Gallagher, Thomas and McLachlan, Scott , journal =. 2018 , doi =

  30. [30]

    2025 , url =

    Wu, Di and Wang, Hongwei and Yu, Wenhao and Zhang, Yuwei and Chang, Kai-Wei and Yu, Dong , booktitle =. 2025 , url =

  31. [31]

    Advances in Neural Information Processing Systems , year =

    A Synthetic Dataset for Personal Attribute Inference , author =. Advances in Neural Information Processing Systems , year =

  32. [32]

    and Srivastava, Divesh and Xiao, Xiaokui , journal =

    Zhang, Jun and Cormode, Graham and Procopiuc, Cecilia M. and Srivastava, Divesh and Xiao, Xiaokui , journal =. 2017 , doi =

  33. [33]

    Personalizing Dialogue Agents:

    Zhang, Saizheng and Dinan, Emily and Urbanek, Jack and Szlam, Arthur and Kiela, Douwe and Weston, Jason , booktitle =. Personalizing Dialogue Agents:. 2018 , doi =

  34. [34]

    2021 , url =

    Zhang, Zhikun and Wang, Tianhao and Li, Ninghui and Honorio, Jean and Backes, Michael and He, Shibo and Chen, Jiming and Zhang, Yang , booktitle =. 2021 , url =

  35. [35]

    2026 , url =

    Mimesis Schema: Foreign-Key References and Structured Generation , author =. 2026 , url =

  36. [36]

    2026 , url =

    Synthetic Data Vault Documentation: Single-Table, Multi-Table, and Sequential Data , author =. 2026 , url =

  37. [37]

    2026 , url =

    pseudopeople Documentation , author =. 2026 , url =