IMGTB: A Framework for Machine-Generated Text Detection Benchmarking

A paper titled "IMGTB: A Framework for Machine-Generated Text Detection Benchmarking" by Michal Spiegel and Dominik Macko from KINIT and the Faculty of Informatics at the Masaryk University in Brno presents a novel framework designed to evaluate and benchmark machine-generated text detection (MGTD) methods.

A rise of high-quality text generation through large language models (LLMs) like GPT-3 has made it increasingly difficult to distinguish between machine-generated and human-authored content, leading to concerns over disinformation, plagiarism, and misuse. The authors of this paper introduce IMGTB, a flexible and customizable framework that allows researchers to objectively compare different MGTD techniques, addressing the challenges of evaluating and replicating MGTD methods.

As LLMs produce human-like texts, detecting machine-generated content has become a crucial challenge to prevent the spread of false information, detecting plagiarism, and ensuring that machine-generated content does not degrade future model training. However, benchmarking MGTD methods is complicated due to frequent introduction of new detection techniques and the lack of standardised evaluation pipelines. IMGTB aims to fill this gap by offering a framework that simplifies the comparison of MGTD methods.

The paper highlights several advantages of the IMGTB framework, including:

  1. Objective Comparisons: The framework allows for the comparison of various detection methods.
  2. Integration of New Methods: IMGTB simplifies the process of adding custom MGTD methods, allowing for easier implementation and integration of the latest techniques.
  3. Flexible Dataset Usage: The framework supports multiple dataset formats and allows users to benchmark detection methods on custom datasets.
  4. Configurable Experiments: IMGTB allows users to configure various benchmark settings, such as the classifier used, through a YAML configuration file or command-line interface (CLI), making it user-friendly for both simple and complex experiments.
  5. Automated Analysis and Visualisation: After running experiments, the framework automatically generates charts and metrics to help visualise the performance of different detectors.

IMGTB is compared to previous frameworks like MGTBench (He et al., 2023), which offered some solutions for benchmarking but lacked flexibility and ease of integration. IMGTB overcomes these limitations by offering a modular approach that allows for seamless integration of new methods and datasets. IMGTB also supports multilingual MGTD and fine-tuning, making it more versatile for use in diverse applications.

The paper includes case studies demonstrating the use of IMGTB. For instance, one scenario involves evaluating detection methods on a new dataset of machine-generated texts using the CLI. This simple experiment, which could have required significant manual work, is accomplished quickly through the framework. Another example illustrates a more complex experiment, where multiple detection methods are tested across various datasets with specific configurations, showcasing the framework's flexibility and power.

The IMGTB framework thus offers an efficient and configurable solution for evaluating MGTD methods, addressing the current challenges of benchmarking in this area. By allowing researchers to easily integrate new methods, utilise custom datasets, and perform automated analysis, the framework accelerates the development of better detection techniques. The authors highlight the potential for further enhancements, such as integrating adversarial methods and expanding into other modalities like image or video detection. Overall, IMGTB provides a comprehensive tool that supports researchers in improving machine-generated text detection and mitigating the misuse of LLM-generated content.

Link to Zenodo: https://zenodo.org/records/13630211

Link to ACL Anthology: https://aclanthology.org/2024.acl-demos.17.pdf

Data in GitHub: https://github.com/ICTMCG/Awesome-Machine-Generated-Text