{"record_type":"pith_number_record","schema_url":"https://pith.science/schemas/pith-number/v1.json","pith_number":"pith:2022:NAHBDSF5ESBDFMTXODYKKPVBR6","short_pith_number":"pith:NAHBDSF5","schema_version":"1.0","canonical_sha256":"680e11c8bd248232b27770f0a53ea18fbf3cf4a28b5bd89b3d3b568171503a49","source":{"kind":"arxiv","id":"2205.12944","version":4},"attestation_state":"computed","paper":{"title":"Learning in Mean Field Games: A Survey","license":"http://arxiv.org/licenses/nonexclusive-distrib/1.0/","headline":"","cross_cats":["cs.AI","cs.GT","math.OC"],"primary_cat":"cs.LG","authors_text":"Julien P\\'erolat, Mathieu Lauri\\`ere, Matthieu Geist, Olivier Pietquin, Paul Muller, Romuald \\'Elie, Sarah Perrin, Sertan Girgin","submitted_at":"2022-05-25T17:49:37Z","abstract_excerpt":"Non-cooperative and cooperative games with a very large number of players have many applications but remain generally intractable when the number of players increases. Introduced by Lasry and Lions, and Huang, Caines and Malham\\'e, Mean Field Games (MFGs) rely on a mean-field approximation to allow the number of players to grow to infinity. Traditional methods for solving these games generally rely on solving partial or stochastic differential equations with a full knowledge of the model. Recently, Reinforcement Learning (RL) has appeared promising to solve complex problems at scale. The combi"},"verification_status":{"content_addressed":true,"pith_receipt":true,"author_attested":false,"weak_author_claims":0,"strong_author_claims":0,"externally_anchored":false,"storage_verified":false,"citation_signatures":0,"replication_records":0,"graph_snapshot":true,"references_resolved":false,"formal_links_present":false},"canonical_record":{"source":{"id":"2205.12944","kind":"arxiv","version":4},"metadata":{"license":"http://arxiv.org/licenses/nonexclusive-distrib/1.0/","primary_cat":"cs.LG","submitted_at":"2022-05-25T17:49:37Z","cross_cats_sorted":["cs.AI","cs.GT","math.OC"],"title_canon_sha256":"cb56462660694a48048dc6799cad6aaaa1bea24b37ffdd290931b0bd61bd275a","abstract_canon_sha256":"3decabd525fcfb56bd4f200792249180e1a2b522f929a564298927b1c91b34a4"},"schema_version":"1.0"},"receipt":{"kind":"pith_receipt","key_id":"pith-v1-2026-05","algorithm":"ed25519","signed_at":"2026-07-05T08:48:56.779397Z","signature_b64":"b3ldo/VK2C8SvdokgntMhfP3BDKHJprVaUmj3xyJqeHB8UGWohgFc3fr4pQT47eDiXA4vJ3xxiCdmfB5y7DXDA==","signed_message":"canonical_sha256_bytes","builder_version":"pith-number-builder-2026-05-17-v1","receipt_version":"0.3","canonical_sha256":"680e11c8bd248232b27770f0a53ea18fbf3cf4a28b5bd89b3d3b568171503a49","last_reissued_at":"2026-07-05T08:48:56.778915Z","signature_status":"signed_v1","first_computed_at":"2026-07-05T08:48:56.778915Z","public_key_fingerprint":"8d4b5ee74e4693bcd1df2446408b0d54"},"graph_snapshot":{"paper":{"title":"Learning in Mean Field Games: A Survey","license":"http://arxiv.org/licenses/nonexclusive-distrib/1.0/","headline":"","cross_cats":["cs.AI","cs.GT","math.OC"],"primary_cat":"cs.LG","authors_text":"Julien P\\'erolat, Mathieu Lauri\\`ere, Matthieu Geist, Olivier Pietquin, Paul Muller, Romuald \\'Elie, Sarah Perrin, Sertan Girgin","submitted_at":"2022-05-25T17:49:37Z","abstract_excerpt":"Non-cooperative and cooperative games with a very large number of players have many applications but remain generally intractable when the number of players increases. Introduced by Lasry and Lions, and Huang, Caines and Malham\\'e, Mean Field Games (MFGs) rely on a mean-field approximation to allow the number of players to grow to infinity. Traditional methods for solving these games generally rely on solving partial or stochastic differential equations with a full knowledge of the model. Recently, Reinforcement Learning (RL) has appeared promising to solve complex problems at scale. The combi"},"claims":{"count":0,"items":[],"snapshot_sha256":"258153158e38e3291e3d48162225fcdb2d5a3ed65a07baac614ab91432fd4f57"},"source":{"id":"2205.12944","kind":"arxiv","version":4},"verdict":{"id":null,"model_set":{},"created_at":null,"strongest_claim":"","one_line_summary":"","pipeline_version":null,"weakest_assumption":"","pith_extraction_headline":""},"integrity":{"clean":true,"summary":{"advisory":0,"critical":0,"by_detector":{},"informational":0},"endpoint":"/pith/2205.12944/integrity.json","findings":[],"available":true,"detectors_run":[],"snapshot_sha256":"c28c3603d3b5d939e8dc4c7e95fa8dfce3d595e45f758748cecf8e644a296938"},"references":{"count":0,"sample":[],"resolved_work":0,"snapshot_sha256":"258153158e38e3291e3d48162225fcdb2d5a3ed65a07baac614ab91432fd4f57","internal_anchors":0},"formal_canon":{"evidence_count":0,"snapshot_sha256":"258153158e38e3291e3d48162225fcdb2d5a3ed65a07baac614ab91432fd4f57"},"author_claims":{"count":0,"strong_count":0,"snapshot_sha256":"258153158e38e3291e3d48162225fcdb2d5a3ed65a07baac614ab91432fd4f57"},"builder_version":"pith-number-builder-2026-05-17-v1"},"aliases":[{"alias_kind":"arxiv","alias_value":"2205.12944","created_at":"2026-07-05T08:48:56.778978+00:00"},{"alias_kind":"arxiv_version","alias_value":"2205.12944v4","created_at":"2026-07-05T08:48:56.778978+00:00"},{"alias_kind":"doi","alias_value":"10.48550/arxiv.2205.12944","created_at":"2026-07-05T08:48:56.778978+00:00"},{"alias_kind":"pith_short_12","alias_value":"NAHBDSF5ESBD","created_at":"2026-07-05T08:48:56.778978+00:00"},{"alias_kind":"pith_short_16","alias_value":"NAHBDSF5ESBDFMTX","created_at":"2026-07-05T08:48:56.778978+00:00"},{"alias_kind":"pith_short_8","alias_value":"NAHBDSF5","created_at":"2026-07-05T08:48:56.778978+00:00"}],"events":[],"event_summary":{},"paper_claims":[],"inbound_citations":{"count":10,"internal_anchor_count":0,"sample":[{"citing_arxiv_id":"2606.20356","citing_title":"Robust $Q$-learning for mean-field control under Wasserstein uncertainty in common noise","ref_index":50,"is_internal_anchor":false},{"citing_arxiv_id":"2607.01525","citing_title":"Mean Field Reinforcement Learning","ref_index":104,"is_internal_anchor":false},{"citing_arxiv_id":"2605.15285","citing_title":"Universal Approximation of Nonlinear Operators and Their Derivatives","ref_index":78,"is_internal_anchor":false},{"citing_arxiv_id":"2605.15602","citing_title":"Travel-time tomography from mean field game dynamics","ref_index":14,"is_internal_anchor":false},{"citing_arxiv_id":"2504.13228","citing_title":"Neural Mean-Field Games: Extending Mean-Field Game Theory with Neural Stochastic Differential Equations","ref_index":63,"is_internal_anchor":false},{"citing_arxiv_id":"2605.15602","citing_title":"Travel-time tomography from mean field game dynamics","ref_index":13,"is_internal_anchor":false},{"citing_arxiv_id":"2605.15285","citing_title":"Universal Approximation of Nonlinear Operators and Their Derivatives","ref_index":78,"is_internal_anchor":false},{"citing_arxiv_id":"2601.18991","citing_title":"Who Restores the Peg? A Mean-Field Game Approach to Model Stablecoin Market Dynamics","ref_index":7,"is_internal_anchor":false},{"citing_arxiv_id":"2605.11042","citing_title":"Towards Model-Free Learning in Dynamic Population Games: An Application to Karma Economies","ref_index":22,"is_internal_anchor":false},{"citing_arxiv_id":"2604.27378","citing_title":"Continuous-time q-learning for mean-field control with common noise, part-II: q-learning algorithms","ref_index":45,"is_internal_anchor":false}]},"formal_canon":{"evidence_count":0,"sample":[],"anchors":[]},"links":{"html":"https://pith.science/pith/NAHBDSF5ESBDFMTXODYKKPVBR6","json":"https://pith.science/pith/NAHBDSF5ESBDFMTXODYKKPVBR6.json","graph_json":"https://pith.science/api/pith-number/NAHBDSF5ESBDFMTXODYKKPVBR6/graph.json","events_json":"https://pith.science/api/pith-number/NAHBDSF5ESBDFMTXODYKKPVBR6/events.json","paper":"https://pith.science/paper/NAHBDSF5"},"agent_actions":{"view_html":"https://pith.science/pith/NAHBDSF5ESBDFMTXODYKKPVBR6","download_json":"https://pith.science/pith/NAHBDSF5ESBDFMTXODYKKPVBR6.json","view_paper":"https://pith.science/paper/NAHBDSF5","resolve_alias":"https://pith.science/api/pith-number/resolve?arxiv=2205.12944&json=true","fetch_graph":"https://pith.science/api/pith-number/NAHBDSF5ESBDFMTXODYKKPVBR6/graph.json","fetch_events":"https://pith.science/api/pith-number/NAHBDSF5ESBDFMTXODYKKPVBR6/events.json","actions":{"anchor_timestamp":"https://pith.science/pith/NAHBDSF5ESBDFMTXODYKKPVBR6/action/timestamp_anchor","attest_storage":"https://pith.science/pith/NAHBDSF5ESBDFMTXODYKKPVBR6/action/storage_attestation","attest_author":"https://pith.science/pith/NAHBDSF5ESBDFMTXODYKKPVBR6/action/author_attestation","sign_citation":"https://pith.science/pith/NAHBDSF5ESBDFMTXODYKKPVBR6/action/citation_signature","submit_replication":"https://pith.science/pith/NAHBDSF5ESBDFMTXODYKKPVBR6/action/replication_record"}},"created_at":"2026-07-05T08:48:56.778978+00:00","updated_at":"2026-07-05T08:48:56.778978+00:00"}