구형준 교수 연구실 | 성균관대 소프트웨어학과

발행물

전체 논문

111

LLM Self Defense: By Self Examination, LLMs Know They Are Being Tricked

Arxiv, 2024

Meta Large Language Model Compiler: Foundation Models of Compiler Optimization

Arxiv, 2024

Unlearning Bias in Language Models by Partitioning Gradients

ACL, 2023

ProPILE: Probing Privacy Leakage in Large Language Models

NeurIPS, 2024

Improving Real-world Password Guessing Attacks via Bi-directional Transformers

USENIX, 2023

Semantic Ranking for Automated Adversarial Technique Annotation in Security Text

ASIA CCS, 2024

LogBERT: Log Anomaly Detection via BERT

IJCNN, 2021

Universal and Transferable Adversarial Attacks on Aligned Language Models

arxiv, 2023

Membership Inference via Backdooring

IJCAI, 2022

Llama Guard: LLM-based Input-Output Safeguard for Human-AI Conversations

Meta, 2023