publications

2026

  1. Sparse Models, Sparse Safety: Unsafe Routes in Mixture-of-Experts LLMs
    Yukun Jiang, Hai Huang, Mingjie Li, Yage Zhang, Michael Backes, and Yang Zhang
    In International Conference on Machine Learning (ICML), 2026
  2. HarmfulSkillBench.png
    HarmfulSkillBench: How Do Harmful Skills Weaponize Your Agents?
    Yukun Jiang, Yage Zhang, Michael Backes, Xinyue Shen, and Yang Zhang
    arXiv preprint, 2026
  3. ACL
    DE_CLIP.png
    DE-CLIP: Few-Shot Anomaly Detection via Difference-Guided Embedding Editing
    Yage Zhang, Yukun Jiang, Michael Backes, and Yang Zhang
    In Annual Meeting of the Association for Computational Linguistics (ACL), 2026
  4. shadowapi.png
    Real Money, Fake Models: Deceptive Model Claims in Shadow APIs
    Yage Zhang, Yukun Jiang, Zeyuan Chen, Michael Backes, Xinyue Shen, and Yang Zhang
    arXiv preprint, 2026
  5. Moltbook_Logo.png
    “Humans welcome to observe”: A First Look at the Agent Social Network Moltbook
    Yukun Jiang*, Yage Zhang*, Xinyue Shen*, Michael Backes, and Yang Zhang
    arXiv preprint, 2026

2025

  1. Cutting the Root of Hallucination: Structural Trimming for Vulnerability Mitigation in Code LLMs
    Yage Zhang
    In Conference on Language Modeling (COLM), 2025
  2. data-10-00077-g007.png
    A Machine Learning Dataset of Artificial Inner Ring Damages on Cylindrical Roller Bearings Measured Under Varying Cross-Influences
    Schnur Christopher, Goodarzi Payman, Robin Yannick, Schneider Tizian, Schauer Julian, El Moutaouakil Houssam, Morsch Jannis, Ahmad Ali Ali, Yage Zhang, and Schütze Andreas
    2025
    Available from Lab for Measurement Technology, Saarland University