Maksym Andriushchenko
Identifiers
- name variant Maksym Andriushchenko 0.60 · backfill
Papers (15)
- What Shapes Emergent Misalignment? Insights from Training Dynamics, Model Priors, and Data cs.AI · 2026 · author #4
- Decomposing and Measuring Evaluation Awareness cs.LG · 2026 · author #6
- FutureSim: Replaying World Events to Evaluate Adaptive Agents cs.LG · 2026 · author #7
- Europe and the Geopolitics of AGI: The Need for a Preparedness Plan cs.CY · 2026 · author #11
- Instrumental Choices: Measuring the Propensity of LLM Agents to Pursue Instrumental Behaviors cs.AI · 2026 · author #3
- Characterizing the Consistency of the Emergent Misalignment Persona cs.AI · 2026 · author #3
- QuantSightBench: Evaluating LLM Quantitative Forecasting with Prediction Intervals cs.LG · 2026 · author #2
- Claudini: Autoresearch Discovers State-of-the-Art Adversarial Attack Algorithms for LLMs cs.LG · 2026 · author #6
- Skill-Inject: Measuring Agent Vulnerability to Skill File Attacks cs.CR · 2026 · author #4
- Helpful to a Fault: Measuring Illicit Assistance in Multi-Turn, Multilingual LLM Agents cs.CL · 2026 · author #4
- AgentHarm: A Benchmark for Measuring Harmfulness of LLM Agents cs.LG · 2024 · author #1
- JailbreakBench: An Open Robustness Benchmark for Jailbreaking Large Language Models cs.CR · 2024 · author #4
- Why ReLU networks yield high-confidence predictions far away from the training data and how to mitigate the problem cs.LG · 2018 · author #2
- Logit Pairing Methods Can Fool Gradient-Based Attacks cs.LG · 2018 · author #2
- Formal Guarantees on the Robustness of a Classifier against Adversarial Manipulation cs.LG · 2017 · author #2
Mentions
- 2606.20814 #4 · arxiv_oai · confidence 0.70 Maksym Andriushchenko
- 2603.24511 #6 · arxiv_oai · confidence 0.70 Maksym Andriushchenko
- 2605.23055 #6 · arxiv_oai · confidence 0.70 Maksym Andriushchenko
- 2602.16346 #4 · arxiv_oai · confidence 0.70 Maksym Andriushchenko
- 2602.20156 #4 · arxiv_oai · confidence 0.70 Maksym Andriushchenko
Frequent Coauthors
- Matthias Hein 3 shared papers
- Anietta Weckauff 2 shared papers
- Jonas Geiping 2 shared papers
- Sahar Abdelnabi 2 shared papers
- Yuchen Zhang 2 shared papers
- Afek Shamir 1 shared papers
- Alexander Panfilov 1 shared papers
- Alexander Robey 1 shared papers
- Alexandra Souly 1 shared papers
- Ameya Prabhu 1 shared papers
- Andy Zou 1 shared papers
- Antoine Bosselut 1 shared papers
- Arvindh Arun 1 shared papers
- Ayush K Tarun 1 shared papers
- Beng\"usu \"Ozcan 1 shared papers
- Changling Li 1 shared papers
- Daan Juijn 1 shared papers
- Dan Hendrycks 1 shared papers
- David Jank\r{u} 1 shared papers
- David Schmotz 1 shared papers