On the Rate-Distortion-Complexity Tradeoff for Semantic Communication

Guangming Shi; Jingxuan Chai; Yong Xiao

arxiv: 2602.14481 · v2 · submitted 2026-02-16 · 💻 cs.IT · cs.AI· math.IT

On the Rate-Distortion-Complexity Tradeoff for Semantic Communication

Jingxuan Chai , Yong Xiao , Guangming Shi This is my paper

Pith reviewed 2026-05-15 22:05 UTC · model grok-4.3

classification 💻 cs.IT cs.AImath.IT

keywords semantic communicationrate-distortion-complexity tradeoffinformation bottleneckminimum description lengthGaussian sourcesbinary sourcessemantic distancemodel complexity

0 comments

The pith

Semantic communication exhibits a three-way tradeoff among rate, semantic distance, and model complexity.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper extends classical rate-distortion theory into a rate-distortion-complexity framework that adds constraints on semantic distance and on model complexity. Closed-form expressions are derived for the minimum rate needed to meet given semantic-distance and complexity targets, first for Gaussian sources and then for binary sources. These expressions show that the three quantities cannot be improved independently: lower complexity forces either a higher rate or greater semantic distance. The tradeoff is validated through experiments on image and video data, where the information-theoretic complexity measure tracks actual training and inference costs.

Core claim

Incorporating semantic distance (via both bit-wise distortion and statistical divergence) and complexity (via minimum description length and information bottleneck) into rate-distortion analysis produces closed-form minimum-rate expressions for Gaussian and binary semantic sources under joint constraints on distance and complexity.

What carries the argument

The rate-distortion-complexity framework, which augments rate-distortion analysis with semantic-distance metrics and an MDL/IB complexity measure.

Load-bearing premise

The complexity measure drawn from minimum description length and information bottleneck theory accurately reflects the practical computational cost of training and running the deep-learning semantic encoders and decoders.

What would settle it

An experiment that trains semantic encoders and decoders of measured complexity, records the rates and semantic distances they actually achieve, and checks whether any point falls below the derived rate bound for its semantic distance and complexity.

Figures

Figures reproduced from arXiv: 2602.14481 by Guangming Shi, Jingxuan Chai, Yong Xiao.

**Figure 2.** Figure 2: Curve plots of the RDC functions for Gaussian semantic sources [PITH_FULL_IMAGE:figures/full_fig_p005_2.png] view at source ↗

**Figure 4.** Figure 4: Curve plots of the RDC functions for Gaussian semantic sources [PITH_FULL_IMAGE:figures/full_fig_p006_4.png] view at source ↗

**Figure 5.** Figure 5: Curve plots of binary RDC tradeoffs: (a) Rate-complexity tradeoff; [PITH_FULL_IMAGE:figures/full_fig_p007_5.png] view at source ↗

**Figure 7.** Figure 7: RDC curves of the image classification task: (a) Rate-complexity [PITH_FULL_IMAGE:figures/full_fig_p008_7.png] view at source ↗

**Figure 8.** Figure 8: The RDC tradeoff curves. objective function for video compression: Lvc := I(U; Sˆ) + λdE[d(S, Sˆ)] + λcI(X;U), (27) where λd and λc denote the tuning parameters that control the RDC tradeoff. Based on (20), the expected loss function Lvc is then lower-bounded by L˜ vcE:=pUSˆ log p(ˆs|u) r(ˆs) + λdE[d(S, Sˆ)] + λcEpXU log p(u|x) t(u) (28) where tU is a given probability distribution. VI. EXPERIMENTAL RESULT… view at source ↗

**Figure 9.** Figure 9: (a) The classification accuracy of the proposed VRDC method under [PITH_FULL_IMAGE:figures/full_fig_p009_9.png] view at source ↗

**Figure 11.** Figure 11: Visual results of the reconstructed image samples across different [PITH_FULL_IMAGE:figures/full_fig_p009_11.png] view at source ↗

**Figure 12.** Figure 12: Curve plots of the reconstruction performance across different [PITH_FULL_IMAGE:figures/full_fig_p010_12.png] view at source ↗

**Figure 13.** Figure 13: Curve plots of the relationship between computation cost (FLOPs) [PITH_FULL_IMAGE:figures/full_fig_p011_13.png] view at source ↗

read the original abstract

Semantic communication is a novel communication paradigm that focuses on conveying the user's intended meaning rather than the bit-wise transmission of source signals. One of the key challenges is to effectively represent and extract the semantic meaning of any given source signals. While deep learning (DL)-based solutions have shown promising results in extracting implicit semantic information from a wide range of sources, existing work often overlooks the high computational complexity inherent in both model training and inference for the DL-based encoder and decoder. To bridge this gap, this paper proposes a rate-distortion-complexity (RDC) framework which extends the classical rate-distortion theory by incorporating the constraints on semantic distance, including both the traditional bit-wise distortion metric and statistical difference-based divergence metric, and complexity measure, adopted from the theory of minimum description length and information bottleneck. We derive the closed-form theoretical results of the minimum achievable rate under given constraints on semantic distance and complexity for both Gaussian and binary semantic sources. Our theoretical results show a fundamental three-way tradeoff among achievable rate, semantic distance, and model complexity. Extensive experiments on real-world image and video datasets validate this tradeoff and further demonstrate that our information-theoretic complexity measure effectively correlates with practical computational costs, guiding efficient system design in resource-constrained scenarios.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

Closed-form minimum-rate expressions for Gaussian and binary sources under joint semantic-distance and MDL/IB complexity constraints are the concrete new piece, but the link from that complexity measure to actual DL compute costs stays thin.

read the letter

The paper works out closed-form minimum rates for Gaussian and binary semantic sources when you constrain both semantic distance (bitwise distortion plus a divergence term) and a complexity penalty taken from minimum description length and information bottleneck theory. That joint formulation and the resulting three-way tradeoff curves are the part that is not already in the classical rate-distortion literature they cite. They also run the expressions on image and video datasets and claim the information-theoretic complexity tracks practical costs well enough to guide design.

Referee Report

2 major / 2 minor

Summary. The paper proposes a rate-distortion-complexity (RDC) framework extending classical rate-distortion theory to semantic communication by incorporating semantic distance constraints (bit-wise distortion and statistical divergence metrics) and a complexity measure drawn from minimum description length and information bottleneck principles. It derives closed-form minimum achievable rates for Gaussian and binary semantic sources under joint constraints on semantic distance and complexity, establishing a fundamental three-way tradeoff among rate, semantic distance, and model complexity. Experiments on real-world image and video datasets are reported to validate the tradeoff and to show that the information-theoretic complexity measure correlates with practical computational costs.

Significance. If the derivations are rigorous and the complexity measure is shown to be practically relevant, the work would be significant for providing an information-theoretic foundation to guide resource-efficient design of deep-learning-based semantic encoders and decoders in constrained environments, moving beyond pure rate-distortion analysis.

major comments (2)

[Theoretical Results] Theoretical Results section: the closed-form minimum-rate expressions for Gaussian and binary sources are presented without derivation steps, explicit definitions of the semantic distance function, or the precise optimization problem being solved; this prevents verification that the expressions are independent rather than reparameterizations of the input MDL/IB definitions.
[Experiments] Complexity measure and experimental validation: the claim that the adopted MDL/IB complexity correlates with practical DL costs (FLOPs, memory, training time) is load-bearing for the practical relevance of the RDC tradeoff, yet the manuscript provides no quantitative correlation coefficients, sensitivity analysis to alternative complexity definitions, or error bars on the dataset results.

minor comments (2)

[Abstract] Abstract and introduction: clarify whether the semantic distance combines bit-wise and divergence metrics additively or via a weighted sum, and state the weighting parameters explicitly.
[Notation] Notation: ensure the complexity scaling constants and semantic distance weighting parameters are defined once and used consistently in all equations and figures.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the detailed and constructive feedback on our manuscript. We address each major comment below and outline the revisions we plan to make.

read point-by-point responses

Referee: [Theoretical Results] Theoretical Results section: the closed-form minimum-rate expressions for Gaussian and binary sources are presented without derivation steps, explicit definitions of the semantic distance function, or the precise optimization problem being solved; this prevents verification that the expressions are independent rather than reparameterizations of the input MDL/IB definitions.

Authors: We appreciate this observation. In the original manuscript, we presented the closed-form expressions directly to focus on the key results, assuming familiarity with rate-distortion theory extensions. However, to allow full verification, we will include the detailed derivation steps, explicit definitions of the semantic distance function (including bit-wise distortion and divergence metrics), and the precise formulation of the optimization problem in the revised Theoretical Results section. This will clarify that the expressions are derived from the joint constraints rather than being mere reparameterizations of the MDL/IB definitions. revision: yes
Referee: [Experiments] Complexity measure and experimental validation: the claim that the adopted MDL/IB complexity correlates with practical DL costs (FLOPs, memory, training time) is load-bearing for the practical relevance of the RDC tradeoff, yet the manuscript provides no quantitative correlation coefficients, sensitivity analysis to alternative complexity definitions, or error bars on the dataset results.

Authors: We agree that quantitative validation of the correlation is important for practical relevance. In the revision, we will add correlation coefficients between the information-theoretic complexity measure and practical metrics such as FLOPs, memory usage, and training time. We will also include error bars on the experimental results and a sensitivity analysis to alternative complexity definitions where feasible. This will strengthen the claim regarding the correlation with DL costs. revision: yes

Circularity Check

0 steps flagged

No significant circularity detected in the RDC derivation chain

full rationale

The paper extends classical rate-distortion theory by adding explicit constraints on semantic distance (bit-wise and divergence-based) and an externally adopted complexity measure from minimum description length and information bottleneck principles. Closed-form minimum-rate expressions are then derived for Gaussian and binary sources under these joint constraints, yielding the claimed three-way tradeoff. This constitutes a standard constrained optimization extension rather than a reduction of the output to the inputs by definition or by self-citation. The complexity measure is imported from established external theory, not fitted or redefined within the paper to force the result. Experiments on image/video datasets supply independent empirical checks. No load-bearing self-citation, ansatz smuggling, or renaming of known results appears in the derivation steps described.

Axiom & Free-Parameter Ledger

2 free parameters · 2 axioms · 0 invented entities

The framework rests on importing complexity definitions from MDL and IB without new independent justification, plus domain assumptions about how semantic distance combines bit-wise and statistical metrics.

free parameters (2)

semantic distance weighting parameters
Weights balancing bit-wise distortion against statistical divergence are introduced to define the composite semantic distance metric.
complexity scaling constants
Scaling factors taken from MDL/IB theory that convert description length into a complexity cost term.

axioms (2)

domain assumption Semantic distance is adequately captured by a linear combination of bit-wise distortion and statistical divergence
Invoked when extending rate-distortion to semantic sources in the abstract.
domain assumption MDL/IB complexity measure is a faithful proxy for deep-learning encoder/decoder computational cost
Adopted without further derivation to incorporate complexity constraints.

pith-pipeline@v0.9.0 · 5519 in / 1375 out tokens · 61830 ms · 2026-05-15T22:05:18.519681+00:00 · methodology

discussion (0)

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

IndisputableMonolith/Cost/FunctionalEquation.lean washburn_uniqueness_aczel (J-cost uniqueness) unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

We adopt I(X;U) as the complexity measure in the RDC framework... derived from the MDL principle and IB (Sec. II-B, Def. 1, Eq. 4)
IndisputableMonolith/Foundation/AlphaCoordinateFixation.lean J_uniquely_calibrated_via_higher_derivative unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

closed-form RG(θd, θp, θc) for Gaussian sources (Theorem 1)

What do these tags mean?

matches: The paper's claim is directly supported by a theorem in the formal canon.
supports: The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends: The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses: The paper appears to rely on the theorem as machinery.
contradicts: The paper's claim conflicts with a theorem or certificate in the canon.
unclear: Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

48 extracted references · 48 canonical work pages · 2 internal anchors

[1]

Recent contributions to the mathematical theory of com- munication,

W. Weaver, “Recent contributions to the mathematical theory of com- munication,”ETC: A Review of General Semantics, pp. 261–281, 1949

work page 1949
[2]

From semantic communication to semantic-aware networking: Model, architecture, and open problems,

G. Shi, Y . Xiao, Y . Li, and X. Xie, “From semantic communication to semantic-aware networking: Model, architecture, and open problems,” IEEE Commun. Mag., vol. 59, no. 8, pp. 44–50, Aug. 2021

work page 2021
[3]

Rate-distortion-perception theory for semantic communication,

J. Chai, Y . Xiao, G. Shi, and W. Saad, “Rate-distortion-perception theory for semantic communication,” inIEEE ICNP, Reykjavik, Iceland, Oct. 2023, pp. 1–6

work page 2023
[4]

On the rate-distortion theory for task-specific semantic communication,

J. Chai, H. Zhu, Y . Xiao, G. Shi, and P. Zhang, “On the rate-distortion theory for task-specific semantic communication,”Entropy, vol. 27, no. 8, p. 775, 2025

work page 2025
[5]

Deep learning enabled semantic communication systems,

H. Xie, Z. Qin, G. Y . Li, and B.-H. Juang, “Deep learning enabled semantic communication systems,”IEEE Trans. Signal Process., vol. 69, pp. 2663–2675, 2021

work page 2021
[6]

Wireless semantic communi- cations for video conferencing,

P. Jiang, C.-K. Wen, S. Jin, and G. Y . Li, “Wireless semantic communi- cations for video conferencing,”IEEE J. Sel. Areas Commun., vol. 41, no. 1, pp. 230–244, 2022

work page 2022
[7]

Rate-distortion theory for strategic semantic communication,

Y . Xiao, X. Zhang, Y . Li, G. Shi, and T. Bas ¸ar, “Rate-distortion theory for strategic semantic communication,” inIEEE ITW, Mumbai, India, Dec. 2022, pp. 279–284

work page 2022
[8]

An indirect rate-distortion characterization for semantic sources: General model and the case of gaussian observation,

J. Liu, S. Shao, W. Zhang, and H. V . Poor, “An indirect rate-distortion characterization for semantic sources: General model and the case of gaussian observation,”IEEE Trans. Commun., vol. 70, no. 9, pp. 5946– 5959, Sep. 2022

work page 2022
[9]

Imitation learning-based implicit semantic-aware communication networks: Multi-layer represen- tation and collaborative reasoning,

Y . Xiao, Z. Sun, G. Shi, and D. Niyato, “Imitation learning-based implicit semantic-aware communication networks: Multi-layer represen- tation and collaborative reasoning,”IEEE J. Sel. Areas Commun., vol. 41, no. 3, pp. 639–658, Mar. 2023

work page 2023
[10]

Towards agentic AI networking in 6G: A generative foundation model-as-agent approach,

Y . Xiao, G. Shi, and P. Zhang, “Towards agentic AI networking in 6G: A generative foundation model-as-agent approach,”IEEE Commun. Mag., 2024

work page 2024
[11]

An information theoretic tradeoff between complexity and accuracy,

R. Gilad-Bachrach, A. Navot, and N. Tishby, “An information theoretic tradeoff between complexity and accuracy,” inLearning Theory and Kernel Machines: 16th Annual Conference on Learning Theory and 7th Kernel Workshop, COLT/Kernel 2003, Washington, DC, USA, August 24-27, 2003. Proceedings. Springer, 2003, pp. 595–609

work page 2003
[12]

The description length of deep learning models,

L. Blier and Y . Ollivier, “The description length of deep learning models,”NIPS, vol. 31, 2018

work page 2018
[13]

The information bottleneck method

N. Tishby, F. C. Pereira, and W. Bialek, “The information bottleneck method,”arXiv preprint physics/0004057, 2000

work page internal anchor Pith review Pith/arXiv arXiv 2000
[14]

Deep learning and the information bottleneck principle,

N. Tishby and N. Zaslavsky, “Deep learning and the information bottleneck principle,” in2015 IEEE ITW workshop. IEEE, 2015, pp. 1–5

work page 2015
[15]

Measuring the VC-dimension of a learning machine,

V . Vapnik, E. Levin, and Y . Le Cun, “Measuring the VC-dimension of a learning machine,”Neural computation, vol. 6, no. 5, pp. 851–876, 1994

work page 1994
[16]

Degrees of freedom in deep neural networks,

T. Gao and V . Jojic, “Degrees of freedom in deep neural networks,” in Proceedings of the Thirty-Second Conference on Uncertainty in Artificial Intelligence, 2016, pp. 232–241

work page 2016
[17]

Measuring the intrinsic dimension of objective landscapes,

C. Li, H. Farkhoor, R. Liu, and J. Yosinski, “Measuring the intrinsic dimension of objective landscapes,” inICLR, 2018

work page 2018
[18]

Deep residual learning for image recognition,

K. He, X. Zhang, S. Ren, and J. Sun, “Deep residual learning for image recognition,” inCVPR, 2016, pp. 770–778

work page 2016
[19]

Compressing neural networks using the variational information bottleneck,

B. Dai, C. Zhu, B. Guo, and D. Wipf, “Compressing neural networks using the variational information bottleneck,” inICML. PMLR, 2018, pp. 1135–1144

work page 2018
[20]

P. D. Gr ¨unwald,The minimum description length principle. MIT press, 2007

work page 2007
[21]

Keeping the neural networks simple by minimizing the description length of the weights,

G. E. Hinton and D. Van Camp, “Keeping the neural networks simple by minimizing the description length of the weights,” inProceedings of the sixth annual conference on Computational learning theory, 1993, pp. 5–13

work page 1993
[22]

D. J. MacKay,Information theory, inference and learning algorithms. Cambridge university press, 2003

work page 2003
[23]

Deep variational information bottleneck,

A. A. Alemi, I. Fischer, J. V . Dillon, and K. Murphy, “Deep variational information bottleneck,” inICLR, 2017

work page 2017
[24]

The conditional entropy bottleneck,

I. Fischer, “The conditional entropy bottleneck,”Entropy, vol. 22, no. 9, p. 999, 2020

work page 2020
[25]

6G networks: Beyond shannon towards semantic and goal-oriented communications,

E. C. Strinati and S. Barbarossa, “6G networks: Beyond shannon towards semantic and goal-oriented communications,”Computer Networks, vol. 190, p. 107930, 2021

work page 2021
[26]

Semantic-Effectiveness Filtering and Control for Post-5G Wireless Connectivity

P. Popovski, O. Simeone, F. Boccardi, D. Gunduz, and O. Sahin, “Semantic-effectiveness filtering and control for post-5G wireless con- nectivity,”IEEE J. Sel. Areas Commun., vol. 39, no. 8, pp. 2418–2434, 2021, arXiv:1907.02441

work page internal anchor Pith review Pith/arXiv arXiv 2021
[27]

Semantics-empowered communication for networked intelligent systems,

M. Kountouris and N. Pappas, “Semantics-empowered communication for networked intelligent systems,”IEEE Communications Magazine, vol. 59, no. 6, pp. 96–102, 2021

work page 2021
[28]

Reasoning over the air: A reasoning-based implicit semantic-aware communication framework,

Y . Xiao, Y . Liao, Y . Li, G. Shi, H. V . Poor, W. Saad, M. Debbah, and M. Bennis, “Reasoning over the air: A reasoning-based implicit semantic-aware communication framework,”IEEE Trans. Wireless Com- mun., Apr. 2024

work page 2024
[29]

Indirect rate distortion problems,

H. Witsenhausen, “Indirect rate distortion problems,”IEEE Trans. Inf. Theory, vol. 26, no. 5, pp. 518–521, 2003

work page 2003
[30]

The perception-distortion tradeoff,

Y . Blau and T. Michaeli, “The perception-distortion tradeoff,” inCVPR, Salt Lake City, Utah, USA, Jun. 2018, pp. 6228–6237

work page 2018
[31]

Rethinking lossy compression: The rate-distortion-perception tradeoff,

——, “Rethinking lossy compression: The rate-distortion-perception tradeoff,” inICML, Long Beach, CA, USA, Jun. 2019, pp. 675–685

work page 2019
[32]

Universal rate-distortion- perception representations for lossy compression,

G. Zhang, J. Qian, J. Chen, and A. Khisti, “Universal rate-distortion- perception representations for lossy compression,”NIPS, vol. 34, pp. 11 517–11 529, 2021

work page 2021
[33]

On the classification-distortion- perception tradeoff,

D. Liu, H. Zhang, and Z. Xiong, “On the classification-distortion- perception tradeoff,”NIPS, vol. 32, 2019

work page 2019
[34]

Task-oriented lossy compression with data, perception, and classification constraints,

Y . Wang, Y . Wu, S. Ma, and Y .-J. A. Zhang, “Task-oriented lossy compression with data, perception, and classification constraints,”IEEE J. Sel. Areas Commun., vol. 43, no. 7, pp. 2635–2650, July 2025

work page 2025
[35]

Learning task-oriented communication for edge inference: An information bottleneck approach,

J. Shao, Y . Mao, and J. Zhang, “Learning task-oriented communication for edge inference: An information bottleneck approach,”IEEE J. Sel. Areas Commun., vol. 40, no. 1, pp. 197–211, 2021

work page 2021
[36]

Semantic communications based on adaptive generative models and information bottleneck,

S. Barbarossa, D. Comminiello, E. Grassucci, F. Pezone, S. Sardellitti, and P. Di Lorenzo, “Semantic communications based on adaptive generative models and information bottleneck,”IEEE Commun. Mag., vol. 61, no. 11, pp. 36–41, 2023

work page 2023
[37]

Robust information bottleneck for task-oriented communication with digital modulation,

S. Xie, S. Ma, M. Ding, Y . Shi, M. Tang, and Y . Wu, “Robust information bottleneck for task-oriented communication with digital modulation,” IEEE J. Sel. Areas Commun., vol. 41, no. 8, pp. 2577–2591, 2023

work page 2023
[38]

Wasserstein generative adver- sarial networks,

M. Arjovsky, S. Chintala, and L. Bottou, “Wasserstein generative adver- sarial networks,” inICML. PMLR, 2017, pp. 214–223

work page 2017
[39]

K. P. Murphy,Machine Learning: A Probabilistic Perspective. MIT Press, 2012

work page 2012
[40]

Information bottleneck for gaussian variables,

G. Chechik, A. Globerson, N. Tishby, and Y . Weiss, “Information bottleneck for gaussian variables,”NIPS, vol. 16, 2003

work page 2003
[41]

The rate-distortion-perception tradeoff: The role of common randomness,

A. B. Wagner, “The rate-distortion-perception tradeoff: The role of common randomness,”arXiv preprint arXiv:2202.04147, 2022

work page arXiv 2022
[42]

On the information bottleneck problems: Models, connections, applications and information theoretic views,

A. Zaidi, I. Estella-Aguerri, and S. Shamai, “On the information bottleneck problems: Models, connections, applications and information theoretic views,”Entropy, vol. 22, no. 2, p. 151, 2020

work page 2020
[43]

Deep learning classification of land cover and crop types using remote sensing data,

N. Kussul, M. Lavreniuk, S. Skakun, and A. Shelestov, “Deep learning classification of land cover and crop types using remote sensing data,” IEEE Geosci. Remote Sens. Lett., vol. 14, no. 5, pp. 778–782, Mar. 2017

work page 2017
[44]

Dvc: An end-to-end deep video compression framework,

G. Lu, W. Ouyang, D. Xu, X. Zhang, C. Cai, and Z. Gao, “Dvc: An end-to-end deep video compression framework,” inCVPR, 2019, pp. 11 006–11 015

work page 2019
[45]

Deep joint source- channel coding for wireless image transmission,

E. Bourtsoulatze, D. B. Kurka, and D. G ¨und¨uz, “Deep joint source- channel coding for wireless image transmission,”IEEE Trans. Cogn. Commun. Netw., vol. 5, no. 3, pp. 567–579, 2019

work page 2019
[46]

Conditional probability models for deep image compression,

F. Mentzer, E. Agustsson, M. Tschannen, R. Timofte, and L. Van Gool, “Conditional probability models for deep image compression,” inCVPR, 2018, pp. 4394–4402

work page 2018
[47]

Video enhance- ment with task-oriented flow,

T. Xue, B. Chen, J. Wu, D. Wei, and W. T. Freeman, “Video enhance- ment with task-oriented flow,”Int. J. Comput. Vis., vol. 127, pp. 1106– 1125, 2019

work page 2019
[48]

Improved techniques for training GANs,

T. Salimans, I. Goodfellow, W. Zaremba, V . Cheung, A. Radford, and X. Chen, “Improved techniques for training GANs,” inNIPS, Barcelona, Spain, Dec. 2016, pp. 2234–2242

work page 2016

[1] [1]

Recent contributions to the mathematical theory of com- munication,

W. Weaver, “Recent contributions to the mathematical theory of com- munication,”ETC: A Review of General Semantics, pp. 261–281, 1949

work page 1949

[2] [2]

From semantic communication to semantic-aware networking: Model, architecture, and open problems,

G. Shi, Y . Xiao, Y . Li, and X. Xie, “From semantic communication to semantic-aware networking: Model, architecture, and open problems,” IEEE Commun. Mag., vol. 59, no. 8, pp. 44–50, Aug. 2021

work page 2021

[3] [3]

Rate-distortion-perception theory for semantic communication,

J. Chai, Y . Xiao, G. Shi, and W. Saad, “Rate-distortion-perception theory for semantic communication,” inIEEE ICNP, Reykjavik, Iceland, Oct. 2023, pp. 1–6

work page 2023

[4] [4]

On the rate-distortion theory for task-specific semantic communication,

J. Chai, H. Zhu, Y . Xiao, G. Shi, and P. Zhang, “On the rate-distortion theory for task-specific semantic communication,”Entropy, vol. 27, no. 8, p. 775, 2025

work page 2025

[5] [5]

Deep learning enabled semantic communication systems,

H. Xie, Z. Qin, G. Y . Li, and B.-H. Juang, “Deep learning enabled semantic communication systems,”IEEE Trans. Signal Process., vol. 69, pp. 2663–2675, 2021

work page 2021

[6] [6]

Wireless semantic communi- cations for video conferencing,

P. Jiang, C.-K. Wen, S. Jin, and G. Y . Li, “Wireless semantic communi- cations for video conferencing,”IEEE J. Sel. Areas Commun., vol. 41, no. 1, pp. 230–244, 2022

work page 2022

[7] [7]

Rate-distortion theory for strategic semantic communication,

Y . Xiao, X. Zhang, Y . Li, G. Shi, and T. Bas ¸ar, “Rate-distortion theory for strategic semantic communication,” inIEEE ITW, Mumbai, India, Dec. 2022, pp. 279–284

work page 2022

[8] [8]

An indirect rate-distortion characterization for semantic sources: General model and the case of gaussian observation,

J. Liu, S. Shao, W. Zhang, and H. V . Poor, “An indirect rate-distortion characterization for semantic sources: General model and the case of gaussian observation,”IEEE Trans. Commun., vol. 70, no. 9, pp. 5946– 5959, Sep. 2022

work page 2022

[9] [9]

Imitation learning-based implicit semantic-aware communication networks: Multi-layer represen- tation and collaborative reasoning,

Y . Xiao, Z. Sun, G. Shi, and D. Niyato, “Imitation learning-based implicit semantic-aware communication networks: Multi-layer represen- tation and collaborative reasoning,”IEEE J. Sel. Areas Commun., vol. 41, no. 3, pp. 639–658, Mar. 2023

work page 2023

[10] [10]

Towards agentic AI networking in 6G: A generative foundation model-as-agent approach,

Y . Xiao, G. Shi, and P. Zhang, “Towards agentic AI networking in 6G: A generative foundation model-as-agent approach,”IEEE Commun. Mag., 2024

work page 2024

[11] [11]

An information theoretic tradeoff between complexity and accuracy,

R. Gilad-Bachrach, A. Navot, and N. Tishby, “An information theoretic tradeoff between complexity and accuracy,” inLearning Theory and Kernel Machines: 16th Annual Conference on Learning Theory and 7th Kernel Workshop, COLT/Kernel 2003, Washington, DC, USA, August 24-27, 2003. Proceedings. Springer, 2003, pp. 595–609

work page 2003

[12] [12]

The description length of deep learning models,

L. Blier and Y . Ollivier, “The description length of deep learning models,”NIPS, vol. 31, 2018

work page 2018

[13] [13]

The information bottleneck method

N. Tishby, F. C. Pereira, and W. Bialek, “The information bottleneck method,”arXiv preprint physics/0004057, 2000

work page internal anchor Pith review Pith/arXiv arXiv 2000

[14] [14]

Deep learning and the information bottleneck principle,

N. Tishby and N. Zaslavsky, “Deep learning and the information bottleneck principle,” in2015 IEEE ITW workshop. IEEE, 2015, pp. 1–5

work page 2015

[15] [15]

Measuring the VC-dimension of a learning machine,

V . Vapnik, E. Levin, and Y . Le Cun, “Measuring the VC-dimension of a learning machine,”Neural computation, vol. 6, no. 5, pp. 851–876, 1994

work page 1994

[16] [16]

Degrees of freedom in deep neural networks,

T. Gao and V . Jojic, “Degrees of freedom in deep neural networks,” in Proceedings of the Thirty-Second Conference on Uncertainty in Artificial Intelligence, 2016, pp. 232–241

work page 2016

[17] [17]

Measuring the intrinsic dimension of objective landscapes,

C. Li, H. Farkhoor, R. Liu, and J. Yosinski, “Measuring the intrinsic dimension of objective landscapes,” inICLR, 2018

work page 2018

[18] [18]

Deep residual learning for image recognition,

K. He, X. Zhang, S. Ren, and J. Sun, “Deep residual learning for image recognition,” inCVPR, 2016, pp. 770–778

work page 2016

[19] [19]

Compressing neural networks using the variational information bottleneck,

B. Dai, C. Zhu, B. Guo, and D. Wipf, “Compressing neural networks using the variational information bottleneck,” inICML. PMLR, 2018, pp. 1135–1144

work page 2018

[20] [20]

P. D. Gr ¨unwald,The minimum description length principle. MIT press, 2007

work page 2007

[21] [21]

Keeping the neural networks simple by minimizing the description length of the weights,

G. E. Hinton and D. Van Camp, “Keeping the neural networks simple by minimizing the description length of the weights,” inProceedings of the sixth annual conference on Computational learning theory, 1993, pp. 5–13

work page 1993

[22] [22]

D. J. MacKay,Information theory, inference and learning algorithms. Cambridge university press, 2003

work page 2003

[23] [23]

Deep variational information bottleneck,

A. A. Alemi, I. Fischer, J. V . Dillon, and K. Murphy, “Deep variational information bottleneck,” inICLR, 2017

work page 2017

[24] [24]

The conditional entropy bottleneck,

I. Fischer, “The conditional entropy bottleneck,”Entropy, vol. 22, no. 9, p. 999, 2020

work page 2020

[25] [25]

6G networks: Beyond shannon towards semantic and goal-oriented communications,

E. C. Strinati and S. Barbarossa, “6G networks: Beyond shannon towards semantic and goal-oriented communications,”Computer Networks, vol. 190, p. 107930, 2021

work page 2021

[26] [26]

Semantic-Effectiveness Filtering and Control for Post-5G Wireless Connectivity

P. Popovski, O. Simeone, F. Boccardi, D. Gunduz, and O. Sahin, “Semantic-effectiveness filtering and control for post-5G wireless con- nectivity,”IEEE J. Sel. Areas Commun., vol. 39, no. 8, pp. 2418–2434, 2021, arXiv:1907.02441

work page internal anchor Pith review Pith/arXiv arXiv 2021

[27] [27]

Semantics-empowered communication for networked intelligent systems,

M. Kountouris and N. Pappas, “Semantics-empowered communication for networked intelligent systems,”IEEE Communications Magazine, vol. 59, no. 6, pp. 96–102, 2021

work page 2021

[28] [28]

Reasoning over the air: A reasoning-based implicit semantic-aware communication framework,

Y . Xiao, Y . Liao, Y . Li, G. Shi, H. V . Poor, W. Saad, M. Debbah, and M. Bennis, “Reasoning over the air: A reasoning-based implicit semantic-aware communication framework,”IEEE Trans. Wireless Com- mun., Apr. 2024

work page 2024

[29] [29]

Indirect rate distortion problems,

H. Witsenhausen, “Indirect rate distortion problems,”IEEE Trans. Inf. Theory, vol. 26, no. 5, pp. 518–521, 2003

work page 2003

[30] [30]

The perception-distortion tradeoff,

Y . Blau and T. Michaeli, “The perception-distortion tradeoff,” inCVPR, Salt Lake City, Utah, USA, Jun. 2018, pp. 6228–6237

work page 2018

[31] [31]

Rethinking lossy compression: The rate-distortion-perception tradeoff,

——, “Rethinking lossy compression: The rate-distortion-perception tradeoff,” inICML, Long Beach, CA, USA, Jun. 2019, pp. 675–685

work page 2019

[32] [32]

Universal rate-distortion- perception representations for lossy compression,

G. Zhang, J. Qian, J. Chen, and A. Khisti, “Universal rate-distortion- perception representations for lossy compression,”NIPS, vol. 34, pp. 11 517–11 529, 2021

work page 2021

[33] [33]

On the classification-distortion- perception tradeoff,

D. Liu, H. Zhang, and Z. Xiong, “On the classification-distortion- perception tradeoff,”NIPS, vol. 32, 2019

work page 2019

[34] [34]

Task-oriented lossy compression with data, perception, and classification constraints,

Y . Wang, Y . Wu, S. Ma, and Y .-J. A. Zhang, “Task-oriented lossy compression with data, perception, and classification constraints,”IEEE J. Sel. Areas Commun., vol. 43, no. 7, pp. 2635–2650, July 2025

work page 2025

[35] [35]

Learning task-oriented communication for edge inference: An information bottleneck approach,

J. Shao, Y . Mao, and J. Zhang, “Learning task-oriented communication for edge inference: An information bottleneck approach,”IEEE J. Sel. Areas Commun., vol. 40, no. 1, pp. 197–211, 2021

work page 2021

[36] [36]

Semantic communications based on adaptive generative models and information bottleneck,

S. Barbarossa, D. Comminiello, E. Grassucci, F. Pezone, S. Sardellitti, and P. Di Lorenzo, “Semantic communications based on adaptive generative models and information bottleneck,”IEEE Commun. Mag., vol. 61, no. 11, pp. 36–41, 2023

work page 2023

[37] [37]

Robust information bottleneck for task-oriented communication with digital modulation,

S. Xie, S. Ma, M. Ding, Y . Shi, M. Tang, and Y . Wu, “Robust information bottleneck for task-oriented communication with digital modulation,” IEEE J. Sel. Areas Commun., vol. 41, no. 8, pp. 2577–2591, 2023

work page 2023

[38] [38]

Wasserstein generative adver- sarial networks,

M. Arjovsky, S. Chintala, and L. Bottou, “Wasserstein generative adver- sarial networks,” inICML. PMLR, 2017, pp. 214–223

work page 2017

[39] [39]

K. P. Murphy,Machine Learning: A Probabilistic Perspective. MIT Press, 2012

work page 2012

[40] [40]

Information bottleneck for gaussian variables,

G. Chechik, A. Globerson, N. Tishby, and Y . Weiss, “Information bottleneck for gaussian variables,”NIPS, vol. 16, 2003

work page 2003

[41] [41]

The rate-distortion-perception tradeoff: The role of common randomness,

A. B. Wagner, “The rate-distortion-perception tradeoff: The role of common randomness,”arXiv preprint arXiv:2202.04147, 2022

work page arXiv 2022

[42] [42]

On the information bottleneck problems: Models, connections, applications and information theoretic views,

A. Zaidi, I. Estella-Aguerri, and S. Shamai, “On the information bottleneck problems: Models, connections, applications and information theoretic views,”Entropy, vol. 22, no. 2, p. 151, 2020

work page 2020

[43] [43]

Deep learning classification of land cover and crop types using remote sensing data,

N. Kussul, M. Lavreniuk, S. Skakun, and A. Shelestov, “Deep learning classification of land cover and crop types using remote sensing data,” IEEE Geosci. Remote Sens. Lett., vol. 14, no. 5, pp. 778–782, Mar. 2017

work page 2017

[44] [44]

Dvc: An end-to-end deep video compression framework,

G. Lu, W. Ouyang, D. Xu, X. Zhang, C. Cai, and Z. Gao, “Dvc: An end-to-end deep video compression framework,” inCVPR, 2019, pp. 11 006–11 015

work page 2019

[45] [45]

Deep joint source- channel coding for wireless image transmission,

E. Bourtsoulatze, D. B. Kurka, and D. G ¨und¨uz, “Deep joint source- channel coding for wireless image transmission,”IEEE Trans. Cogn. Commun. Netw., vol. 5, no. 3, pp. 567–579, 2019

work page 2019

[46] [46]

Conditional probability models for deep image compression,

F. Mentzer, E. Agustsson, M. Tschannen, R. Timofte, and L. Van Gool, “Conditional probability models for deep image compression,” inCVPR, 2018, pp. 4394–4402

work page 2018

[47] [47]

Video enhance- ment with task-oriented flow,

T. Xue, B. Chen, J. Wu, D. Wei, and W. T. Freeman, “Video enhance- ment with task-oriented flow,”Int. J. Comput. Vis., vol. 127, pp. 1106– 1125, 2019

work page 2019

[48] [48]

Improved techniques for training GANs,

T. Salimans, I. Goodfellow, W. Zaremba, V . Cheung, A. Radford, and X. Chen, “Improved techniques for training GANs,” inNIPS, Barcelona, Spain, Dec. 2016, pp. 2234–2242

work page 2016