Publications
† First author †* Co-first author (marked with *)
To Appear / Published
DITTO: A Spoofing Attack Framework on Watermarked LLMs via Knowledge Distillation
EACL, Main Conference, to appear, 2026.
Marking Code Without Breaking It: Code Watermarking for Detecting LLM-Generated Code
EACL, Findings, to appear, 2026.
* Equal contribution
WaterMod: Modular Token-Rank Partitioning for Probability-Balanced LLM Watermarking
AAAI, to appear (Oral), 2026.
EnCur: Curriculum-Based In-Context Learning with Structural Encoding for Code Time Complexity Prediction
Expert Systems with Applications, Vol. 296, 129094, January 2026.
Detecting Code Paraphrased by Large Language Models using Coding Style Features
Engineering Applications of Artificial Intelligence, Vol. 162, December 2025.
Mondrian: A Framework for Logical Abstract (Re)Structuring
EMNLP 2025 (Main Conference), pp. 33663--33678.
TrapDoc: Deceiving LLM Users by Injecting Imperceptible Phantom Tokens into Documents
Findings of EMNLP 2025, pp. 18881--18897.
Advanced Code Time Complexity Prediction Approach Using Contrastive Learning
Engineering Applications of Artificial Intelligence, Vol. 151, July 2025.
KatFishNet: Detecting LLM-Generated Korean Text through Linguistic Feature Analysis
ACL 2025 (Main Conference), pp. 21189–21222.
ConPrompt: Pre-training a Language Model with Machine-Generated Data for Implicit Hate Speech Detection
Findings of EMNLP 2023, pp. 10964–10980.
Contrastive Learning with Keyword-based Data Augmentation for Code Search and Code Question Answering
EACL 2023 (Main Conference), pp. 3609–3619.
Generalizable Implicit Hate Speech Detection using Contrastive Learning
COLING 2022, pp. 6667–6679.
Under Review
From Intuition to Expertise: Rubric-Based Cognitive Calibration for Human Detection of LLM-Generated Korean Text
Steering Language Models Before They Speak: Logit-Level Interventions
A Linguistics-Aware LLM Watermarking via Syntactic Predictability
Select then MixUp: Improving Out-of-Distribution Natural Language Code Search