Safeguarding Language Models via Self-Destruct Trapdoor

Abstract

TBA.

Publication
Conference of the European Chapter of the Association for Computational Linguistics (EACL)