Publications
† First author †* Co-first author (marked with *)
To Appear / Published
Detecting Code Paraphrased by Large Language Models using Coding Style Features
Engineering Applications of Artificial Intelligence, to appear.
TrapDoc: Deceiving LLM Users by Injecting Imperceptible Phantom Tokens into Documents
Findings of EMNLP, to appear, 2025.
Mondrian: A Framework for Logical Abstract (Re)Structuring
EMNLP (Main Conference), to appear, 2025.
EnCur: Curriculum-Based In-Context Learning with Structural Encoding for Code Time Complexity Prediction
Expert Systems with Applications, to appear.
Advanced Code Time Complexity Prediction Approach Using Contrastive Learning
Engineering Applications of Artificial Intelligence, Vol. 151, July 2025.
KatFishNet: Detecting LLM-Generated Korean Text through Linguistic Feature Analysis
ACL 2025 (Main Conference), pp. 21189–21222.
ConPrompt: Pre-training a Language Model with Machine-Generated Data for Implicit Hate Speech Detection
Findings of EMNLP 2023, pp. 10964–10980.
Contrastive Learning with Keyword-based Data Augmentation for Code Search and Code Question Answering
EACL 2023, pp. 3609–3619.
Generalizable Implicit Hate Speech Detection using Contrastive Learning
COLING 2022, pp. 6667–6679.
Under Review
WaterMod: Modular Token-Rank Partitioning for Probability-Balanced LLM Watermarking
Marking Code Without Breaking It: Code Watermarking for Detecting LLM-Generated Code
* Equal contribution
Select then MixUp: Improving Out-of-Distribution Natural Language Code Search
Medea: A Test for Rationality in Artificial Intelligence Systems