PEACE: Cross-Platform Hate Speech Detection- A Causality-guided Framework Paper • 2306.08804 • Published Jun 15, 2023
Causality Guided Disentanglement for Cross-Platform Hate Speech Detection Paper • 2308.02080 • Published Aug 3, 2023
J-Guard: Journalism Guided Adversarially Robust Detection of AI-generated News Paper • 2309.03164 • Published Sep 6, 2023
ConDA: Contrastive Domain Adaptation for AI-generated Text Detection Paper • 2309.03992 • Published Sep 7, 2023
Harnessing Artificial Intelligence to Combat Online Hate: Exploring the Challenges and Opportunities of Large Language Models in Hate Speech Detection Paper • 2403.08035 • Published Mar 12, 2024
A Survey of AI-generated Text Forensic Systems: Detection, Attribution, and Characterization Paper • 2403.01152 • Published Mar 2, 2024
Cross-Platform Hate Speech Detection with Weakly Supervised Causal Disentanglement Paper • 2404.11036 • Published Apr 17, 2024
Mindful-RAG: A Study of Points of Failure in Retrieval Augmented Generation Paper • 2407.12216 • Published Jul 16, 2024
Stylometric Detection of AI-Generated Text in Twitter Timelines Paper • 2303.03697 • Published Mar 7, 2023
Towards Safety Reasoning in LLMs: AI-agentic Deliberation for Policy-embedded CoT Data Creation Paper • 2505.21784 • Published May 27 • 17