Loading Now

Summary of Genai Content Detection Task 2: Ai Vs. Human — Academic Essay Authenticity Challenge, by Shammur Absar Chowdhury et al.


GenAI Content Detection Task 2: AI vs. Human – Academic Essay Authenticity Challenge

by Shammur Absar Chowdhury, Hind Almerekhi, Mucahid Kutlu, Kaan Efe Keles, Fatema Ahmad, Tasnim Mohiuddin, George Mikros, Firoj Alam

First submitted to arxiv on: 24 Dec 2024

Categories

  • Main: Computation and Language (cs.CL)
  • Secondary: Artificial Intelligence (cs.AI)

     Abstract of paper      PDF of paper


GrooveSquid.com Paper Summaries

GrooveSquid.com’s goal is to make artificial intelligence research accessible by summarizing AI papers in simpler terms. Each summary below covers the same AI paper, written at different levels of difficulty. The medium difficulty and low difficulty versions are original summaries written by GrooveSquid.com, while the high difficulty version is the paper’s original abstract. Feel free to learn from the version that suits you best!

Summary difficulty Written by Summary
High Paper authors High Difficulty Summary
Read the original abstract here
Medium GrooveSquid.com (original content) Medium Difficulty Summary
The paper presents an overview of the Academic Essay Authenticity Challenge, a shared task organized as part of GenAI Content Detection at COLING 2025. The challenge aims to detect whether essays are machine-generated or human-authored for academic purposes. The task involves identifying whether an essay is generated by a machine or authored by a human. The challenge is conducted in two languages: English and Arabic. A total of 25 teams submitted systems for English, while 21 teams participated in the Arabic track. Additionally, seven teams submitted system description papers. The majority of submissions utilized fine-tuned transformer-based models, with one team employing Large Language Models (LLMs) such as Llama 2 and Llama 3. This paper outlines the task formulation, dataset construction process, evaluation framework, and summarizes approaches adopted by participating teams.
Low GrooveSquid.com (original content) Low Difficulty Summary
The challenge is about detecting whether an essay was written by a machine or a person for academic purposes. The challenge involves two languages: English and Arabic. Many teams took part in this challenge and used special models to analyze the essays. Most of these models were based on transformers, which are a type of AI model. One team even used super powerful language models called Llama 2 and Llama 3. This paper tells us about the challenge, how it was set up, and what different teams did.

Keywords

» Artificial intelligence  » Llama  » Transformer