hub

InProceedings of the 18th Confer- ence of the European Chapter of the Association for Computational Linguistics, pages 139–151

Yingzhou Lu, Minjie Shen, Huazheng Wang, Xiao Wang, Capucine van Rechem, Tianfan Fu, Wenqi Wei · 2023 · arXiv 2302.04062

11 Pith papers cite this work. Polarity classification is still indexing.

11 Pith papers citing it

read on arXiv browse 11 citing papers

hub tools

JSON dossier citing papers JSON arXiv source

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

When Does Model Collapse Occur in Structured Interactive Learning?

cs.LG · 2026-05-19 · unverdicted · novelty 7.0

Model collapse occurs in structured interactive learning if and only if the directed interaction graph satisfies a specific topological condition, with finite-sample guarantees for linear regression and asymptotic results for M-estimators.

HopWeaver: Cross-Document Synthesis of High-Quality and Authentic Multi-Hop Questions

cs.CL · 2025-05-21 · unverdicted · novelty 7.0

HopWeaver automatically synthesizes authentic bridge and comparison multi-hop questions from cross-document sources via a pipeline that identifies complementary documents and builds reasoning paths.

SADGE: Structure and Appearance Domain Gap Estimation of Synthetic and Real Data

cs.CV · 2026-05-21 · unverdicted · novelty 6.0

SADGE is a new fused similarity metric combining DINOv3 appearance and MASt3R geometry via constrained bilinear interaction that correlates with downstream synthetic-to-real performance at Pearson r=0.88 across multiple benchmarks.

What Makes Synthetic Data Effective in Image Segmentation

cs.CV · 2026-05-19 · unverdicted · novelty 6.0

Dense scene composition and instance fidelity in synthetic diffusion images drive better segmentation performance; SENSE framework exploits this to improve models on Cityscapes, COCO, and ADE20K.

Firefly: Illuminating Large-Scale Verified Tool-Call Data Generation from Real APIs

cs.SE · 2026-05-17 · unverdicted · novelty 6.0

FireFly inverts task synthesis by exploring real MCP servers first via pairwise tool graphs and sub-DAG sampling, then generates 5,144 verified tasks backward from outcomes to train a 4B model that matches Claude Sonnet 4.6 on tool-calling benchmarks.

Generative AI-Based Monte Carlo Simulation for Method Evaluation Using Synthetic Multilevel Data

stat.ME · 2026-05-07 · unverdicted · novelty 6.0

A framework using generative AI to produce synthetic multilevel data for Monte Carlo simulations that evaluate the performance and parameter recovery of quantitative methods.

Stochastic dynamics learning with state-space systems

stat.ML · 2025-08-11 · unverdicted · novelty 6.0

Establishes that fading memory and solution stability hold generically in state-space systems for reservoir computing even without the echo state property, with a distributional attractor perspective for stochastic cases.

CoX-MoE: Coalesced Expert Execution for High-Throughput MoE Inference with AMX-Enabled CPU-GPU Co-Execution

cs.LG · 2026-05-18 · unverdicted · novelty 5.0

CoX-MoE achieves up to 7.1x higher throughput than FlexGen for MoE inference via coalesced expert execution and AMX-enabled CPU-GPU orchestration with static expert stratification.

Fundamental Trade-Offs in Multi-Bit Watermarking of Stochastic Processes

cs.IT · 2026-05-09 · unverdicted · novelty 5.0

Derives matched converse and achievability bounds that characterize optimal trade-offs among false-alarm probability, detection error probability, distortion, and information rate for multi-bit watermarking of stationary ergodic stochastic processes.

Self-Improving Tabular Language Models via Iterative Reward-Guided Post-Training

cs.LG · 2026-04-21 · unverdicted · novelty 5.0

TabGRAA applies group-relative advantage alignment in an iterative reward-guided post-training loop to improve tabular language model generators on fidelity, utility, and privacy trade-offs across five benchmarks.

Can Synthetic Data be Fair and Private? A Comparative Study of Synthetic Data Generation and Fairness Algorithms

cs.LG · 2025-01-03 · unverdicted · novelty 5.0

DECAF synthetic data generator best balances privacy and fairness while fairness pre-processing improves outcomes more on synthetic data than real data, though at some cost to predictive accuracy.

citing papers explorer

Showing 11 of 11 citing papers.

When Does Model Collapse Occur in Structured Interactive Learning? cs.LG · 2026-05-19 · unverdicted · none · ref 21
Model collapse occurs in structured interactive learning if and only if the directed interaction graph satisfies a specific topological condition, with finite-sample guarantees for linear regression and asymptotic results for M-estimators.
HopWeaver: Cross-Document Synthesis of High-Quality and Authentic Multi-Hop Questions cs.CL · 2025-05-21 · unverdicted · none · ref 3
HopWeaver automatically synthesizes authentic bridge and comparison multi-hop questions from cross-document sources via a pipeline that identifies complementary documents and builds reasoning paths.
SADGE: Structure and Appearance Domain Gap Estimation of Synthetic and Real Data cs.CV · 2026-05-21 · unverdicted · none · ref 25
SADGE is a new fused similarity metric combining DINOv3 appearance and MASt3R geometry via constrained bilinear interaction that correlates with downstream synthetic-to-real performance at Pearson r=0.88 across multiple benchmarks.
What Makes Synthetic Data Effective in Image Segmentation cs.CV · 2026-05-19 · unverdicted · none · ref 11
Dense scene composition and instance fidelity in synthetic diffusion images drive better segmentation performance; SENSE framework exploits this to improve models on Cityscapes, COCO, and ADE20K.
Firefly: Illuminating Large-Scale Verified Tool-Call Data Generation from Real APIs cs.SE · 2026-05-17 · unverdicted · none · ref 6
FireFly inverts task synthesis by exploring real MCP servers first via pairwise tool graphs and sub-DAG sampling, then generates 5,144 verified tasks backward from outcomes to train a 4B model that matches Claude Sonnet 4.6 on tool-calling benchmarks.
Generative AI-Based Monte Carlo Simulation for Method Evaluation Using Synthetic Multilevel Data stat.ME · 2026-05-07 · unverdicted · none · ref 36
A framework using generative AI to produce synthetic multilevel data for Monte Carlo simulations that evaluate the performance and parameter recovery of quantitative methods.
Stochastic dynamics learning with state-space systems stat.ML · 2025-08-11 · unverdicted · none · ref 58
Establishes that fading memory and solution stability hold generically in state-space systems for reservoir computing even without the echo state property, with a distributional attractor perspective for stochastic cases.
CoX-MoE: Coalesced Expert Execution for High-Throughput MoE Inference with AMX-Enabled CPU-GPU Co-Execution cs.LG · 2026-05-18 · unverdicted · none · ref 18
CoX-MoE achieves up to 7.1x higher throughput than FlexGen for MoE inference via coalesced expert execution and AMX-enabled CPU-GPU orchestration with static expert stratification.
Fundamental Trade-Offs in Multi-Bit Watermarking of Stochastic Processes cs.IT · 2026-05-09 · unverdicted · none · ref 6
Derives matched converse and achievability bounds that characterize optimal trade-offs among false-alarm probability, detection error probability, distortion, and information rate for multi-bit watermarking of stationary ergodic stochastic processes.
Self-Improving Tabular Language Models via Iterative Reward-Guided Post-Training cs.LG · 2026-04-21 · unverdicted · none · ref 51
TabGRAA applies group-relative advantage alignment in an iterative reward-guided post-training loop to improve tabular language model generators on fidelity, utility, and privacy trade-offs across five benchmarks.
Can Synthetic Data be Fair and Private? A Comparative Study of Synthetic Data Generation and Fairness Algorithms cs.LG · 2025-01-03 · unverdicted · none · ref 41
DECAF synthetic data generator best balances privacy and fairness while fairness pre-processing improves outcomes more on synthetic data than real data, though at some cost to predictive accuracy.

InProceedings of the 18th Confer- ence of the European Chapter of the Association for Computational Linguistics, pages 139–151

hub tools

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer