PLUS Research Group
PLUS Research Group
Home
News
People
Publications
Contact
Light
Dark
Automatic
Safeguarding Language Models via Self-Destruct Trapdoor
Shahar Katz
,
Bar Alon
,
Ariel Shaulov
,
Lior Wolf
,
Mahmood Sharif
February 2026
Cite
Abstract
TBA.
Type
Conference paper
Publication
Conference of the European Chapter of the Association for Computational Linguistics (EACL)
Cite
×