Soraya Partow

Research output

Publications

Publications are listed with their current status. Manuscripts under review and work in preparation are labeled honestly and not presented as accepted. For BibTeX, preprints, or further details, please get in touch.

Accepted

  • Values Alignment and Stability Tracking in Autonomous Agents (VAST): A Framework for Monitoring Moral Drift

    Accepted

    Soraya Partow, et al.

    IEEE ICAD 2026 — IEEE International Conference on Autonomous Decision-Making, 2026

  • Auditable On-Chain Governance for Autonomous AI Agents: A VAST-Blockchain Approach

    Accepted

    Soraya Partow, et al.

    IEEE ICAD 2026 — IEEE International Conference on Autonomous Decision-Making, 2026

  • Behavioral Game-Theoretic Defense Strategies for IoT Security under Bounded Rationality

    Accepted

    Soraya Partow, et al.

    IEEE ICC 2026 — IEEE International Conference on Communications, 2026

Under review

  • Multi-Level Strategic Defense against Hardware Trojans: A Game-Theoretic Perspective

    Under review

    Soraya Partow, et al.

    IEEE IMNS 2026 — IEEE International Conference on Microelectronics and Nanoscale Systems, 2026

    Manuscript submitted.

  • COALITION-VAST: Auditable Multi-Agent Alignment Under Byzantine Governance

    Under review

    Soraya Partow, Satyaki Nan

    IEEE IMNS 2026 — IEEE International Conference on Microelectronics and Nanoscale Systems, 2026

    Manuscript submitted.

  • When Safe-Looking Models Fail: Exposing a Hidden Decision Gap in Multi-Turn Conversational Safety

    Under review

    Soraya Partow, Satyaki Nan

    IEEE IMNS 2026 — IEEE International Conference on Microelectronics and Nanoscale Systems, 2026

    Manuscript submitted.

In preparation

  • Who Grades the Graders? Validating Automated Evaluators in Adversarial AI Safety Benchmarks

    In preparation

    Soraya Partow, et al.

    NeurIPS 2026 (target), 2026

    Paper in preparation; target venue only.