Disarranged Harmonization of Transparency Reporting by Social Media Platforms Under the Digital Services Act
Pith reviewed 2026-05-19 22:24 UTC · model grok-4.3
The pith
Transparency reports from major social media platforms remain inconsistent and incomplete under the Digital Services Act.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
Despite the DSA's push for harmonized transparency reporting, the eight largest EU social media platforms display varying compliance levels with persistent issues in data formatting, timeliness, consistency, and completeness; some platforms use different procedures across mechanisms and submit contrasting information; interoperability between mechanisms stays limited; and many previously noted problems with transparency reporting remain unresolved.
What carries the argument
Structured comparative assessment of key reporting dimensions through large-scale quantitative analyses on data quality and consistency across platforms and reporting mechanisms.
If this is right
- Harmonization under the DSA has not produced consistent reporting across different mechanisms for the same platform.
- Data quality problems continue to obstruct effective auditing of platform transparency.
- Interoperability between reporting mechanisms is still blocked by differing procedures.
- Many pre-DSA issues with transparency reporting have not been fixed by the new rules.
Where Pith is reading between the lines
- Mandating uniform data formats and submission deadlines could reduce the observed inconsistencies.
- Auditors may need to cross-check multiple reporting channels to obtain a reliable picture of platform activity.
- Persistent gaps could limit the ability of researchers and regulators to track changes in content moderation over time.
Load-bearing premise
The transparency reports submitted by platforms are sufficiently complete and accessible to allow direct quantitative comparison of data quality and consistency across different reporting mechanisms without significant missing context or selection effects.
What would settle it
Locating one platform that submits identical, timely, complete, and consistently formatted data through every required DSA reporting mechanism would undermine the claim of widespread unresolved issues.
Figures
read the original abstract
The European Commission recently introduced new regulation to harmonize transparency reporting of large online platforms under the Digital Services Act (DSA). Here, we present the first systematic evaluation of transparency reporting data quality after this normative change, for the eight largest social media platforms in the European Union. In detail, we run a set of large-scale quantitative analyses on key reporting dimensions, followed by a structured comparative assessment across platforms and reporting mechanisms. Among our findings is that: (i) the analyzed platforms had varying degrees of compliance and data quality, but all exhibited issues on data formatting, timeliness, consistency, and completeness; (ii) some platforms employed differing reporting procedures across mechanisms, which caused them to submit contrasting information; (iii) despite the harmonization, a number of issues still prevent interoperability between reporting mechanisms; and (iv) many of the previously identified issues with transparency reporting are still unresolved. We conclude by discussing implications for transparency auditing and proposing key targeted improvements to strengthen the reliability and interoperability of DSA transparency reporting.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript presents the first systematic evaluation of transparency reporting data quality for the eight largest social media platforms in the EU under the Digital Services Act (DSA). Through large-scale quantitative analyses on key reporting dimensions and a structured comparative assessment across platforms and reporting mechanisms, it finds that all platforms exhibit issues with data formatting, timeliness, consistency, and completeness; some platforms employ differing procedures leading to contrasting information; interoperability between mechanisms remains limited; and many previously identified transparency issues persist. The paper concludes with implications for auditing and targeted improvement proposals.
Significance. If the empirical findings hold, this work provides timely evidence on the practical outcomes of DSA harmonization efforts, highlighting persistent gaps in transparency reporting that could inform regulatory refinements and auditing standards. The comparative assessment across eight platforms and multiple mechanisms is a clear strength, offering a broad view of compliance variation that prior studies have not systematically addressed at this scale.
major comments (3)
- [Methods] Methods section: The retrieval protocol, inclusion criteria, and handling of missing entries or inaccessible historical versions for the DSA transparency reports are not detailed. This is load-bearing for the central claims, as the findings that all platforms exhibited issues on formatting, timeliness, consistency, and completeness, plus contrasting information from differing procedures, assume the collected reports form a representative sample without selection effects from platform-specific publication practices.
- [Results] Results section (quantitative analyses): Specifics on dataset sizes, error handling, statistical methods, or metrics for assessing data quality dimensions are absent. This undermines evaluation of the robustness of the claim that all eight platforms exhibited issues, particularly given the abstract's emphasis on large-scale analyses.
- [Comparative assessment] Comparative assessment: The assertion that differing reporting procedures caused platforms to submit contrasting information requires concrete examples linked to specific data points, tables, or figures to demonstrate the scale and implications for interoperability.
minor comments (2)
- [Abstract] Abstract: Consider briefly specifying one or two example metrics or dimensions used in the quantitative analyses to give readers a clearer sense of the evaluation scope.
- Notation and terminology: Ensure consistent use of terms like 'reporting mechanisms' and 'transparency reports' throughout to avoid minor ambiguity in cross-platform comparisons.
Simulated Author's Rebuttal
We thank the referee for their constructive and detailed feedback, which helps clarify key aspects of our methodology and presentation of results. We address each major comment below, indicating revisions where we agree additional detail or examples will strengthen the manuscript.
read point-by-point responses
-
Referee: [Methods] Methods section: The retrieval protocol, inclusion criteria, and handling of missing entries or inaccessible historical versions for the DSA transparency reports are not detailed. This is load-bearing for the central claims, as the findings that all platforms exhibited issues on formatting, timeliness, consistency, and completeness, plus contrasting information from differing procedures, assume the collected reports form a representative sample without selection effects from platform-specific publication practices.
Authors: We agree that greater detail on data collection is needed to support the representativeness of our sample and address potential selection effects. The current manuscript outlines the overall approach at a high level but does not fully specify the retrieval protocol, inclusion criteria, or procedures for missing or historical versions. In the revised manuscript, we will expand the Methods section with a dedicated subsection describing the data sources (official DSA repositories and platform disclosures), the time window and search strategy used, explicit inclusion/exclusion criteria, and how inaccessible or missing reports were handled, including any platform-specific publication variations encountered. revision: yes
-
Referee: [Results] Results section (quantitative analyses): Specifics on dataset sizes, error handling, statistical methods, or metrics for assessing data quality dimensions are absent. This undermines evaluation of the robustness of the claim that all eight platforms exhibited issues, particularly given the abstract's emphasis on large-scale analyses.
Authors: We acknowledge that the Results section would benefit from explicit reporting of dataset characteristics and analytical procedures to allow readers to assess robustness. While the manuscript presents the outcomes of the large-scale quantitative analyses on the key dimensions, it does not currently include dataset sizes, error-handling steps, or the precise metrics and methods applied. In the revision, we will add these specifics, including the number of reports analyzed per platform and mechanism, how inconsistencies or errors were identified and coded, the quantitative metrics used for each quality dimension (e.g., formatting compliance rates, timeliness thresholds), and any descriptive or comparative statistics employed. revision: yes
-
Referee: [Comparative assessment] Comparative assessment: The assertion that differing reporting procedures caused platforms to submit contrasting information requires concrete examples linked to specific data points, tables, or figures to demonstrate the scale and implications for interoperability.
Authors: We agree that linking the observed differences in reporting procedures to concrete examples would improve clarity and demonstrate the practical implications. The comparative assessment section already identifies instances where platforms used differing procedures across mechanisms leading to contrasting information, but these could be more explicitly tied to underlying data. In the revised manuscript, we will add specific examples drawn from the collected reports, cross-referenced to particular data points, and include or expand a table or figure that illustrates selected cases of inconsistency and their effects on interoperability. revision: yes
Circularity Check
No circularity: empirical analysis of public DSA transparency reports relies on direct data comparison without derivations or self-referential reductions
full rationale
The paper presents a systematic evaluation of transparency reports submitted by eight social media platforms under the DSA. It describes running large-scale quantitative analyses on dimensions such as formatting, timeliness, consistency, and completeness, followed by comparative assessment. No equations, fitted parameters, predictions, or first-principles derivations are claimed or present in the abstract or described methodology. The central findings rest on direct inspection and comparison of publicly accessible reports rather than any internal construction where outputs reduce to inputs by definition or self-citation chains. This is a standard empirical study of external data sources, self-contained against public benchmarks, with no load-bearing steps that exhibit the enumerated circularity patterns.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption The eight largest social media platforms in the EU are representative for assessing overall DSA transparency reporting compliance.
Lean theorems connected to this paper
-
IndisputableMonolith/Foundation/RealityFromDistinction.leanreality_from_one_distinction unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
All eight platforms exhibited issues on data formatting, timeliness, consistency, and completeness
-
IndisputableMonolith/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
quantitative analyses on key reporting dimensions
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
PeerJ Computer Science , volume=
How to detect propaganda from social media? Exploitation of semantic and fine-tuned language models , author=. PeerJ Computer Science , volume=. 2023 , publisher=
work page 2023
-
[2]
Advances in machine learning algorithms for hate speech detection in social media: A review , author=. IEEE Access , volume=. 2021 , publisher=
work page 2021
-
[3]
ACM Computing Surveys , volume=
Multi-modal misinformation detection: Approaches, challenges and opportunities , author=. ACM Computing Surveys , volume=. 2024 , publisher=
work page 2024
-
[4]
Journal of Computer and Communications , volume=
Deepfakes detection techniques using deep learning: A survey , author=. Journal of Computer and Communications , volume=
-
[5]
Proceedings of the 2021 CHI conference on human factors in computing systems , pages=
The psychological well-being of content moderators: the emotional labor of commercial moderation and avenues for improving support , author=. Proceedings of the 2021 CHI conference on human factors in computing systems , pages=
work page 2021
-
[6]
Proceedings of the ACM on Human-Computer Interaction , volume=
Does transparency in moderation really matter? User behavior after content removal explanations on reddit , author=. Proceedings of the ACM on Human-Computer Interaction , volume=. 2019 , publisher=
work page 2019
-
[7]
Automated Transparency: A legal and empirical analysis of the
Kaushal, Rishabh and Van De Kerkhof, Jacob and Goanta, Catalina and Spanakis, Gerasimos and Iamnitchi, Adriana , booktitle=. Automated Transparency: A legal and empirical analysis of the
- [8]
-
[9]
Proceedings of the ACM on Human-Computer Interaction , volume=
Disproportionate removals and differing content moderation experiences for conservative, transgender, and black social media users: Marginalization and moderation gray areas , author=. Proceedings of the ACM on Human-Computer Interaction , volume=. 2021 , publisher=
work page 2021
-
[10]
Proceedings of the ACM on human-computer interaction , volume=
Contestability for content moderation , author=. Proceedings of the ACM on human-computer interaction , volume=. 2021 , publisher=
work page 2021
-
[11]
The Palgrave Handbook of Global Social Problems , pages=
Impact of Social Media Among Vulnerable Sections of Society and the Construction of Social Problems , author=. The Palgrave Handbook of Global Social Problems , pages=. 2023 , publisher=
work page 2023
-
[12]
Engineering Applications of Artificial Intelligence , volume=
Identification of cyber harassment and intention of target users on social media platforms , author=. Engineering Applications of Artificial Intelligence , volume=. 2022 , publisher=
work page 2022
-
[13]
Trujillo, Amaury and Fagni, Tiziano and Cresci, Stefano , booktitle=
-
[14]
Dergacheva, Daria and Kuznetsova, Vasilisa and Scharlach, Rebecca and Katzenbach, Christian , year=. One day in content moderation: Analyzing 24h of social media platforms’ content decisions through the
-
[15]
Content moderation on social media in the
Drolsbach, Chiara Patricia and Pr. Content moderation on social media in the. ACM WebConf Companion , year=
-
[16]
Pornographic content classification using deep-learning , year =
Tabone, Andr\'. Pornographic content classification using deep-learning , year =
-
[17]
How transparent are transparency reports?
Urman, Aleksandra and Makhortykh, Mykola , journal=. How transparent are transparency reports?. 2023 , publisher=
work page 2023
-
[18]
Enabling research with publicly accessible platform data: Early
Jaursch, Julian and Ohme, Jakob and Klinger, Ulrike , year=. Enabling research with publicly accessible platform data: Early
-
[19]
Regulation on a Single Market For Digital Services (Digital Services Act) and amending Directive , year = 2022, note =
work page 2022
-
[20]
Shahi, Gautam Kishore and Tessa, Benedetta and Trujillo, Amaury and Cresci, Stefano , booktitle=
-
[21]
Van de Kerkhof, J , journal=
-
[22]
Foundation model transparency reports , author=. AAAI/ACM AIES , year=
-
[23]
Li, Huaxia and Gao, Haoyun and Wu, Chengzhang and Vasarhelyi, Miklos A , journal=. 2024 , publisher=
work page 2024
-
[24]
Can large language models replace humans in systematic reviews?
Khraisha, Qusai and Put, Sophie and Kappenberg, Johanna and Warraitch, Azza and Hadfield, Kristin , journal=. Can large language models replace humans in systematic reviews?. 2024 , publisher=
work page 2024
- [25]
-
[26]
Content moderation and platform observability in the
Papaevangelou, Charis and Votta, Fabio , year=. Content moderation and platform observability in the
-
[27]
Cima, Lorenzo and Miaschi, Alessio and Trujillo, Amaury and Avvenuti, Marco and Dell'Orletta, Felice and Cresci, Stefano , booktitle =
-
[28]
LLM s to the Rescue: Explaining DSA Statements of Reason with Platform's Terms of Services
Aspromonte, Marco and Ferraris, Andrea and Galli, Federico and Contissa, Giuseppe. LLM s to the Rescue: Explaining DSA Statements of Reason with Platform's Terms of Services. NLLP. 2024
work page 2024
-
[29]
E er, Leonard and Spanakis, Gerasimos. Linking Transparency and Accountability: Analysing The Connection Between T ik T ok ' s Terms of Service and Moderation Decisions. NLLP. 2025
work page 2025
-
[30]
Improving regulatory oversight in online content moderation , author=. ECML-PKDD Workshops , year=
-
[31]
Telecommunications Policy , year=
Big data, small answers: How the DSA Transparency Database falls short of its regulatory objectives , author=. Telecommunications Policy , year=
- [32]
-
[33]
Delayed takedown of illegal content on social media makes moderation ineffective , author=. arXiv:2502.08841 , year=
-
[34]
It is unfair, and it would be unwise to expect the user to know the law!
“It is unfair, and it would be unwise to expect the user to know the law!”--Evaluating reporting mechanisms under the Digital Services Act , author=. ACM FAccT '25 , year =
-
[35]
Platforms under the Digital Services Act , author=
The Great Data Standoff: Researchers vs. Platforms under the Digital Services Act , author=. AAAI ICWSM , year =
-
[36]
``There is literally zero funding''': Understanding the Emerging Role of Trusted Flaggers under the EU Digital Services Act , author=. arXiv:2603.29874 , year=
-
[37]
When Transparency Falls Short: Auditing Platform Moderation During a High-Stakes Election
When Transparency Falls Short: Auditing Platform Moderation During a High-Stakes Election , author=. arXiv:2604.19285 , year=
work page internal anchor Pith review Pith/arXiv arXiv
-
[38]
Santa Clara principles on transparency and accountability in content moderation , howpublished =. 2021 , note =
work page 2021
-
[39]
Beyond phase-in: assessing impacts on disinformation of the EU Digital Services Act , author=. AI and Ethics , volume=. 2025 , publisher=
work page 2025
-
[40]
Social Responsibility Journal , volume=
Transparency reports as CSR reports: motives, stakeholders, and strategies , author=. Social Responsibility Journal , volume=. 2024 , publisher=
work page 2024
-
[41]
Annual Review of Law and Social Science , volume=
Regulating Content Moderation for Democracy: A Transatlantic Divide , author=. Annual Review of Law and Social Science , volume=. 2025 , publisher=
work page 2025
-
[42]
Jaursch, Julian and Ohme, Jakob and Klinger, Ulrike , title=
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.