구형준 교수 연구실
연구실 정보 수정하기
홈
기본 정보
연구 영역
프로젝트
발행물
구성원
발행물
논문
컨퍼런스
전체 논문
111
필터 설정하기
71
LLM Self Defense: By Self Examination, LLMs Know They Are Being Tricked
Arxiv, 2024
72
Meta Large Language Model Compiler: Foundation Models of Compiler Optimization
Arxiv, 2024
73
Unlearning Bias in Language Models by Partitioning Gradients
ACL, 2023
74
ProPILE: Probing Privacy Leakage in Large Language Models
NeurIPS, 2024
75
Improving Real-world Password Guessing Attacks via Bi-directional Transformers
USENIX, 2023
76
Semantic Ranking for Automated Adversarial Technique Annotation in Security Text
ASIA CCS, 2024
77
LogBERT: Log Anomaly Detection via BERT
IJCNN, 2021
78
Universal and Transferable Adversarial Attacks on Aligned Language Models
arxiv, 2023
79
Membership Inference via Backdooring
IJCAI, 2022
80
Llama Guard: LLM-based Input-Output Safeguard for Human-AI Conversations
Meta, 2023
1
2
3
4
5
6
7
8
9
10