Research output
Publications
Publications are listed with their current status. Manuscripts under review and work in preparation are labeled honestly and not presented as accepted. For BibTeX, preprints, or further details, please get in touch.
Accepted
Values Alignment and Stability Tracking in Autonomous Agents (VAST): A Framework for Monitoring Moral Drift
AcceptedSoraya Partow, et al.
IEEE ICAD 2026 — IEEE International Conference on Autonomous Decision-Making, 2026
Auditable On-Chain Governance for Autonomous AI Agents: A VAST-Blockchain Approach
AcceptedSoraya Partow, et al.
IEEE ICAD 2026 — IEEE International Conference on Autonomous Decision-Making, 2026
Behavioral Game-Theoretic Defense Strategies for IoT Security under Bounded Rationality
AcceptedSoraya Partow, et al.
IEEE ICC 2026 — IEEE International Conference on Communications, 2026
Under review
Multi-Level Strategic Defense against Hardware Trojans: A Game-Theoretic Perspective
Under reviewSoraya Partow, et al.
IEEE IMNS 2026 — IEEE International Conference on Microelectronics and Nanoscale Systems, 2026
Manuscript submitted.
COALITION-VAST: Auditable Multi-Agent Alignment Under Byzantine Governance
Under reviewSoraya Partow, Satyaki Nan
IEEE IMNS 2026 — IEEE International Conference on Microelectronics and Nanoscale Systems, 2026
Manuscript submitted.
When Safe-Looking Models Fail: Exposing a Hidden Decision Gap in Multi-Turn Conversational Safety
Under reviewSoraya Partow, Satyaki Nan
IEEE IMNS 2026 — IEEE International Conference on Microelectronics and Nanoscale Systems, 2026
Manuscript submitted.
In preparation
Who Grades the Graders? Validating Automated Evaluators in Adversarial AI Safety Benchmarks
In preparationSoraya Partow, et al.
NeurIPS 2026 (target), 2026
Paper in preparation; target venue only.