Summary of Genai Content Detection Task 2: Ai Vs. Human — Academic Essay Authenticity Challenge, by Shammur Absar Chowdhury et al.
GenAI Content Detection Task 2: AI vs. Human – Academic Essay Authenticity Challenge
by Shammur Absar Chowdhury, Hind Almerekhi, Mucahid Kutlu, Kaan Efe Keles, Fatema Ahmad, Tasnim Mohiuddin, George Mikros, Firoj Alam
First submitted to arxiv on: 24 Dec 2024
Categories
- Main: Computation and Language (cs.CL)
- Secondary: Artificial Intelligence (cs.AI)
GrooveSquid.com Paper Summaries
GrooveSquid.com’s goal is to make artificial intelligence research accessible by summarizing AI papers in simpler terms. Each summary below covers the same AI paper, written at different levels of difficulty. The medium difficulty and low difficulty versions are original summaries written by GrooveSquid.com, while the high difficulty version is the paper’s original abstract. Feel free to learn from the version that suits you best!
Summary difficulty | Written by | Summary |
---|---|---|
High | Paper authors | High Difficulty Summary Read the original abstract here |
Medium | GrooveSquid.com (original content) | Medium Difficulty Summary The paper presents an overview of the Academic Essay Authenticity Challenge, a shared task organized as part of GenAI Content Detection at COLING 2025. The challenge aims to detect whether essays are machine-generated or human-authored for academic purposes. The task involves identifying whether an essay is generated by a machine or authored by a human. The challenge is conducted in two languages: English and Arabic. A total of 25 teams submitted systems for English, while 21 teams participated in the Arabic track. Additionally, seven teams submitted system description papers. The majority of submissions utilized fine-tuned transformer-based models, with one team employing Large Language Models (LLMs) such as Llama 2 and Llama 3. This paper outlines the task formulation, dataset construction process, evaluation framework, and summarizes approaches adopted by participating teams. |
Low | GrooveSquid.com (original content) | Low Difficulty Summary The challenge is about detecting whether an essay was written by a machine or a person for academic purposes. The challenge involves two languages: English and Arabic. Many teams took part in this challenge and used special models to analyze the essays. Most of these models were based on transformers, which are a type of AI model. One team even used super powerful language models called Llama 2 and Llama 3. This paper tells us about the challenge, how it was set up, and what different teams did. |
Keywords
» Artificial intelligence » Llama » Transformer