pith. sign in

arxiv: 2507.14808 · v3 · submitted 2025-07-20 · 💱 q-fin.CP · cs.CE· cs.LG

Decoding RWA Tokenized U.S. Treasuries: Functional Dissection and Address Role Inference

Pith reviewed 2026-05-19 04:45 UTC · model grok-4.3

classification 💱 q-fin.CP cs.CEcs.LG
keywords RWA tokenized treasuriessmart contract decodingaddress role inferencerepresentation learningblockchain transaction analysiseconomic role classificationmulti-chain RWA
0
0 comments X

The pith

A curvature-aware model infers economic roles of addresses from decoded transactions in tokenized U.S. Treasuries.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper decodes smart contract calls for tokens such as BUIDL, BENJI, and USDY across Ethereum and layer-2 chains to expose core operations including issuance, redemption, transfer, and bridging. These patterns highlight differences in how large institutional participants interact compared with smaller or retail users. The authors introduce a curvature-aware representation learning model that classifies addresses into roles such as institutional treasuries, arbitrage bots, and retail traders based on behavioral patterns. This model outperforms baselines on their collected dataset and generalizes to broader public blockchain transaction data.

Core claim

Decoded contract calls in tokenized U.S. Treasury RWAs reveal financial primitives that distinguish participant types, and a curvature-aware representation learning model infers address-level economic roles from these behavioral patterns, outperforming baseline models on the U.S. Treasury transaction dataset while generalizing to address classification across other public blockchain datasets.

What carries the argument

Curvature-aware representation learning model that embeds decoded contract call behaviors to classify addresses by economic role.

If this is right

  • Reveals the current extent and limits of retail participation in RWA adoption.
  • Enables distinction between institutional treasuries, arbitrage bots, and retail traders.
  • Supports more transparent, inclusive, and accountable Web3 finance through better participant insight.
  • Extends role inference techniques to address classification tasks on other blockchain datasets.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The same decoding and embedding approach could extend to role analysis in other tokenized real-world assets.
  • Regulators might use similar models to monitor concentration or unusual activity in growing RWA markets.
  • Future work could combine the method with partial known-entity data to create hybrid supervised-unsupervised validation.

Load-bearing premise

Behavioral patterns extracted from decoded contract calls reliably indicate distinct economic roles without external ground-truth labels or validation against known entities.

What would settle it

If role classifications from the model show no significant accuracy gain over baselines when tested against a dataset with partial off-chain entity labels for the same addresses.

read the original abstract

Tokenized U.S. Treasuries have emerged as a prominent subclass of real-world assets (RWAs), offering cryptographically secured, yield-bearing instruments issued across multi-chain Web3 infrastructures, with growing significance for transparency, accessibility, and financial inclusion. While the market has expanded rapidly, empirical analyses of transaction-level behaviours remain limited. This paper conducts a quantitative, function-level dissection of U.S. Treasury-backed RWA tokens, including BUIDL, BENJI, and USDY across multi-chain: mostly Ethereum and Layer-2s. Decoded contract calls expose core financial primitives such as issuance, redemption, transfer, and bridging, revealing patterns that distinguish institutional participants from smaller or retail users for the extent and limits of inclusivity in current RWA adoption. To infer address-level economic roles, we introduce a curvature-aware representation learning model. Our method outperforms baseline models in role inference on our collected U.S. Treasury transaction dataset and generalizes to address classification across broader public blockchain transaction datasets. The decoded transaction-level patterns in tokenized U.S. Treasuries across chains surface the degree of retail participation, and the role inference model enables the distinction between institutional treasuries, arbitrage bots, and retail traders based on behavioral patterns, facilitating future more transparent, inclusive, and accountable Web3 finance.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 2 minor

Summary. The paper dissects tokenized U.S. Treasuries (BUIDL, BENJI, USDY) across Ethereum and Layer-2 chains by decoding contract calls for primitives including issuance, redemption, transfer, and bridging. It introduces a curvature-aware representation learning model to infer address-level economic roles (institutional treasuries, arbitrage bots, retail traders) from behavioral patterns extracted from these calls, claiming outperformance over baselines on the authors' collected Treasury RWA dataset and generalization to broader public blockchain transaction datasets.

Significance. If the role-inference results hold under independent validation, the work would supply rare transaction-level empirical evidence on RWA participant behavior and a new representation-learning approach for on-chain address classification. These elements could support downstream regulatory monitoring and market-structure studies in tokenized fixed-income markets.

major comments (2)
  1. [Abstract] Abstract: the central claim that the curvature-aware model 'outperforms baseline models in role inference' and 'generalizes' is stated without any accuracy, F1, or other quantitative metrics, without dataset cardinality, without baseline definitions, and without a validation procedure; this absence is load-bearing because the entire contribution rests on the empirical superiority assertion.
  2. [Role-inference section] Role-inference section (model description and evaluation): behavioral patterns from decoded calls are used to assign roles (institutional vs. arbitrage vs. retail) without any external ground-truth labels, known-entity registries, or cross-validation against exchange or institutional address lists; if the labels are derived from the same heuristics or clustering that the model exploits, the reported outperformance reduces to internal consistency rather than genuine role discovery.
minor comments (2)
  1. [Model description] Notation for the curvature-aware embedding is introduced without an explicit equation or pseudocode; a compact mathematical definition would improve reproducibility.
  2. [Data section] The multi-chain dataset construction (which chains, which token contracts, time window) is described only at high level; a table summarizing transaction counts per token and per chain would aid readers.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We are grateful to the referee for providing a thorough review of our manuscript on decoding RWA Tokenized U.S. Treasuries. Below, we respond to each major comment and outline the revisions planned for the next version of the paper.

read point-by-point responses
  1. Referee: [Abstract] Abstract: the central claim that the curvature-aware model 'outperforms baseline models in role inference' and 'generalizes' is stated without any accuracy, F1, or other quantitative metrics, without dataset cardinality, without baseline definitions, and without a validation procedure; this absence is load-bearing because the entire contribution rests on the empirical superiority assertion.

    Authors: We agree that the abstract would benefit from including quantitative details to support our claims. In the revised version of the manuscript, we will update the abstract to report specific performance metrics such as accuracy and F1 scores achieved by our curvature-aware model compared to baselines, the cardinality of the dataset (number of transactions and unique addresses), definitions of the baseline models, and a summary of the validation procedure used. This will provide readers with a clearer understanding of the empirical results without needing to refer to the main text. revision: yes

  2. Referee: [Role-inference section] Role-inference section (model description and evaluation): behavioral patterns from decoded calls are used to assign roles (institutional vs. arbitrage vs. retail) without any external ground-truth labels, known-entity registries, or cross-validation against exchange or institutional address lists; if the labels are derived from the same heuristics or clustering that the model exploits, the reported outperformance reduces to internal consistency rather than genuine role discovery.

    Authors: We appreciate this important point regarding the validation of role assignments. The roles in our study are inferred based on behavioral patterns derived from the decoded contract calls, using a curvature-aware representation learning approach that captures transaction curvatures and embeddings. To address potential concerns about label derivation, we will revise the role-inference section to provide more details on the label assignment process, including any use of heuristic rules for initial labeling and how the model is trained and evaluated to avoid circularity. We will also include results from generalization experiments on public blockchain datasets, where external address classifications may be cross-referenced where possible. We will explicitly note the limitations regarding ground truth availability in this domain. revision: partial

Circularity Check

0 steps flagged

No significant circularity detected

full rationale

The paper introduces a curvature-aware representation learning model for inferring economic roles from decoded contract calls in tokenized U.S. Treasury transactions. No equations, fitted parameters, or self-citations are exhibited that reduce any claimed prediction or result to an input by construction. The model is presented as an independent methodological contribution, with performance evaluated against baselines on a collected dataset and generalization tested on broader blockchain data. The derivation chain remains self-contained against external benchmarks, with no load-bearing steps that equate outputs to inputs via definition, renaming, or unverified self-citation chains. Absence of external ground-truth labels raises questions of empirical validity but does not constitute circularity under the specified criteria.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 1 invented entities

Review performed on abstract only; full methods, data, and model specification unavailable. Free parameters and detailed axioms cannot be enumerated.

axioms (1)
  • domain assumption Decoded contract calls and transaction sequences reflect distinct economic participant types
    Central to the role-inference task described in the abstract.
invented entities (1)
  • curvature-aware representation learning model no independent evidence
    purpose: Infer address-level economic roles from transaction behavior
    New technique introduced for the role classification task

pith-pipeline@v0.9.0 · 5777 in / 1271 out tokens · 55481 ms · 2026-05-19T04:45:10.082076+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

What do these tags mean?
matches
The paper's claim is directly supported by a theorem in the formal canon.
supports
The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends
The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses
The paper appears to rely on the theorem as machinery.
contradicts
The paper's claim conflicts with a theorem or certificate in the canon.
unclear
Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. SoK of RWA Tokenization: A Systematization of Concepts, Architectures, and Legal Interoperability

    q-fin.GN 2026-04 unverdicted novelty 5.0

    RWA tokenization converts passive assets into programmable economic agents but requires resolving oracle problems and jurisdictional gaps, acting as a transitional bridge rather than an inevitable endpoint toward unif...

Reference graph

Works this paper leans on

31 extracted references · 31 canonical work pages · cited by 1 Pith paper

  1. [1]

    role2vec: Role-based network embeddings

    Nesreen K Ahmed, Ryan A Rossi, John Boaz Lee, Theodore L Willke, Rong Zhou, Xiangnan Kong, and Hoda Eldardiry. role2vec: Role-based network embeddings. Proc. DLG KDD , pages 1--7, 2019

  2. [2]

    A labeled transactions-based dataset on the ethereum network

    Salam Al-E’mari, Mohammed Anbar, Yousef Sanjalawe, and Selvakumar Manickam. A labeled transactions-based dataset on the ethereum network. In International Conference on Advances in Cyber Security , pages 61--79. Springer, 2020

  3. [3]

    Revealing market dynamics pattern of defi token transaction in crypto industry

    Andry Alamsyah and Ivan Farid Muhamad. Revealing market dynamics pattern of defi token transaction in crypto industry. In 2023 International Conference on Data Science and Its Applications (ICoDSA) , pages 379--384. IEEE, 2023

  4. [4]

    Economic impact potential of real-world asset tokenization

    Markuss Baltais, Evita Sondore, Talis J Putni n s a, and Jonathan R Karlsen. Economic impact potential of real-world asset tokenization. UTS Business School, University of Technology Sydney, Report , pages 2024--06, 2024

  5. [5]

    Erc-20: fungible tokens

    Davi Pedro Bauer. Erc-20: fungible tokens. In Getting Started with Ethereum: A Step-by-Step Guide to Becoming a Blockchain Developer , pages 17--48. Springer, 2022

  6. [6]

    Blockchain is watching you: Profiling and deanonymizing ethereum users

    Ferenc B \'e res, Istv \'a n A Seres, Andr \'a s A Bencz \'u r, and Mikerah Quintyne-Collins. Blockchain is watching you: Profiling and deanonymizing ethereum users. In 2021 IEEE international conference on decentralized applications and infrastructures (DAPPS) , pages 69--78. IEEE, 2021

  7. [7]

    Exploring the security issues of real world assets (rwa)

    Shijian Chen, Muhui Jiang, and Xiapu Luo. Exploring the security issues of real world assets (rwa). In Proceedings of the Workshop on Decentralized Finance and Security , pages 31--40, 2024

  8. [8]

    An overview of blockchains’ de-anonymization attacks

    Andrea Gangemi. An overview of blockchains’ de-anonymization attacks. CrypTorino 2021 , page 21, 2024

  9. [9]

    Franklin templeton’s \ 380m benji token upgrade is just for institutional investors — for now

    Aleks Gilbert. Franklin templeton’s \ 380m benji token upgrade is just for institutional investors — for now. DL News , April 2024. URL: https://www.dlnews.com/articles/markets/franlin-templeton-upgrades-benji-tokens-for-some-investors/

  10. [10]

    node2vec: Scalable feature learning for networks

    Aditya Grover and Jure Leskovec. node2vec: Scalable feature learning for networks. In Proceedings of the 22nd ACM SIGKDD international conference on Knowledge discovery and data mining , pages 855--864, 2016

  11. [11]

    Real estate security token offerings and the secondary market: Driven by crypto hype or fundamentals? Journal of Banking & Finance , 154:106940, 2023

    Julia Kreppmeier, Ralf Laschinger, Bertram I Steininger, and Gregor Dorfleitner. Real estate security token offerings and the secondary market: Driven by crypto hype or fundamentals? Journal of Banking & Finance , 154:106940, 2023

  12. [12]

    Real-world assets (rwa)

    Ledger Academy . Real-world assets (rwa). https://www.ledger.com/academy/glossary/real-world-assets-rwa, 2024. Accessed: 2025-05-07

  13. [13]

    Decoupled weight decay regularization

    Ilya Loshchilov and Frank Hutter. Decoupled weight decay regularization. In International Conference on Learning Representations , 2018

  14. [14]

    Ondo short-term us government treasuries (ousg)

    Ondo Finance . Ondo short-term us government treasuries (ousg). https://ondo.finance/ousg, 2025. Accessed: 2025-05-07

  15. [15]

    Rwa tokenization: Catching up with the numbers, the institutional players, and the market predictions

    Olayimika Oyebanji. Rwa tokenization: Catching up with the numbers, the institutional players, and the market predictions. The Institutional Players, And The Market Predictions (November 08, 2024) , 2024

  16. [16]

    Deepwalk: Online learning of social representations

    Bryan Perozzi, Rami Al-Rfou, and Steven Skiena. Deepwalk: Online learning of social representations. In Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining , pages 701--710, 2014

  17. [17]

    Sigtran: signature vectors for detecting illicit activities in blockchain transaction networks

    Farimah Poursafaei, Reihaneh Rabbany, and Zeljko Zilic. Sigtran: signature vectors for detecting illicit activities in blockchain transaction networks. In Pacific-Asia Conference on Knowledge Discovery and Data Mining , pages 27--39. Springer, 2021

  18. [18]

    The role of rwa-tokenization in the innovative transformation of the financial sector: essence, features, market overview

    Maryna Riabokin and Yevgen Kotukh. The role of rwa-tokenization in the innovative transformation of the financial sector: essence, features, market overview. Finance of Ukraine , (11):101--116, 2024

  19. [19]

    Characteristic functions on graphs: Birds of a feather, from statistical descriptors to parametric models

    Benedek Rozemberczki and Rik Sarkar. Characteristic functions on graphs: Birds of a feather, from statistical descriptors to parametric models. In Proceedings of the 29th ACM international conference on information & knowledge management , pages 1325--1334, 2020

  20. [20]

    Tokenized u.s

    RWA.xyz . Tokenized u.s. treasuries. https://app.rwa.xyz/treasuries, 2025. Accessed: 2025-05-07

  21. [21]

    Cryptocurrency compliance and operations: Digital assets, blockchain and DeFi

    Jason Scharfman. Cryptocurrency compliance and operations: Digital assets, blockchain and DeFi . Springer, 2021

  22. [22]

    Centrifuge tinlake: Adding real-world assets to mcd

    Lea Schmitt. Centrifuge tinlake: Adding real-world assets to mcd. https://medium.com/centrifuge/centrifuge-tinlake-adding-real-world-assets-to-mcd-68cbcb67e9a4, 2019

  23. [23]

    Security token standard

    Security Token Standard . Security token standard. https://thesecuritytokenstandard.org/, 2025

  24. [24]

    Real estate tokenisation: Benefits, limitations and market dynamics

    Rafferty Smith and Dirk G Baur. Real estate tokenisation: Benefits, limitations and market dynamics. Limitations and Market Dynamics (January 01, 2025) , 2025

  25. [25]

    Overview of buidl

    Steakhouse Financial . Overview of buidl. https://www.steakhouse.financial/projects/blackrock-buidl, 2024

  26. [26]

    Franklin templeton announces the franklin onchain u.s

    Stellar Development Foundation . Franklin templeton announces the franklin onchain u.s. government money fund surpasses \ 270 million in assets under management, April 2023. URL: https://stellar.org/press/franklin-templeton-announces-the-franklin-onchain-u-s-government-money-fund / surpasses-270-million-in-assets-under-management

  27. [27]

    Empirical evidence on the ownership and liquidity of real estate tokens

    Laurens Swinkels. Empirical evidence on the ownership and liquidity of real estate tokens. Financial Innovation , 9(1):45, 2023

  28. [28]

    Tokenized assets in a decentralized economy: Balancing efficiency, value, and risks

    Umair Tanveer, Shamaila Ishaq, and Thinh Gia Hoang. Tokenized assets in a decentralized economy: Balancing efficiency, value, and risks. International Journal of Production Economics , page 109554, 2025

  29. [29]

    Securities and Exchange Commission

    U.S. Securities and Exchange Commission . Regulation s—rules governing offers and sales made outside the united states without registration under the securities act of 1933. https://www.ecfr.gov/current/title-17/part-230/subject-group-ECFR69201f82e35ad1c, May 1990. 17 CFR §§ 230.901–230.905

  30. [30]

    Toward understanding asset flows in crypto money laundering through the lenses of ethereum heists

    Jiajing Wu, Dan Lin, Qishuang Fu, Shuo Yang, Ting Chen, Zibin Zheng, and Bowen Song. Toward understanding asset flows in crypto money laundering through the lenses of ethereum heists. IEEE Transactions on Information Forensics and Security , 19:1994--2009, 2024. https://doi.org/10.1109/TIFS.2023.3346276 doi:10.1109/TIFS.2023.3346276

  31. [31]

    Who are the phishers? phishing scam detection on ethereum via network embedding

    Jiajing Wu, Qi Yuan, Dan Lin, Wei You, Weili Chen, Chuan Chen, and Zibin Zheng. Who are the phishers? phishing scam detection on ethereum via network embedding. IEEE Transactions on Systems, Man, and Cybernetics: Systems , 2020