копировать удалить добавить публикацию в буфер
Запись сообщества
посмотреть историю данной записи
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Bot or Human? Detecting ChatGPT Imposters with A Single Question

H. Wang, X. Luo, W. Wang, и X. Yan. (2023)cite arxiv:2305.06424.

Аннотация

Large language models like ChatGPT have recently demonstrated impressive capabilities in natural language understanding and generation, enabling various applications including translation, essay writing, and chit-chatting. However, there is a concern that they can be misused for malicious purposes, such as fraud or denial-of-service attacks. Therefore, it is crucial to develop methods for detecting whether the party involved in a conversation is a bot or a human. In this paper, we propose a framework named FLAIR, Finding Large language model Authenticity via a single Inquiry and Response, to detect conversational bots in an online manner. Specifically, we target a single question scenario that can effectively differentiate human users from bots. The questions are divided into two categories: those that are easy for humans but difficult for bots (e.g., counting, substitution, positioning, noise filtering, and ASCII art), and those that are easy for bots but difficult for humans (e.g., memorization and computation). Our approach shows different strengths of these questions in their effectiveness, providing a new way for online service providers to protect themselves against nefarious activities and ensure that they are serving real users. We open-sourced our dataset on https://github.com/hongwang600/FLAIR and welcome contributions from the community to enrich such detection datasets.

Описание

Bot or Human? Detecting ChatGPT Imposters with A Single Question

Линки и ресурсы

ключ BibTeX: wang2023human
тип записи: misc
год: 2023
url: http://arxiv.org/abs/2305.06424
Примечание: cite arxiv:2305.06424

тэги

Цитировать эту публикацию

@misc{wang2023human, abstract = {Large language models like ChatGPT have recently demonstrated impressive capabilities in natural language understanding and generation, enabling various applications including translation, essay writing, and chit-chatting. However, there is a concern that they can be misused for malicious purposes, such as fraud or denial-of-service attacks. Therefore, it is crucial to develop methods for detecting whether the party involved in a conversation is a bot or a human. In this paper, we propose a framework named FLAIR, Finding Large language model Authenticity via a single Inquiry and Response, to detect conversational bots in an online manner. Specifically, we target a single question scenario that can effectively differentiate human users from bots. The questions are divided into two categories: those that are easy for humans but difficult for bots (e.g., counting, substitution, positioning, noise filtering, and ASCII art), and those that are easy for bots but difficult for humans (e.g., memorization and computation). Our approach shows different strengths of these questions in their effectiveness, providing a new way for online service providers to protect themselves against nefarious activities and ensure that they are serving real users. We open-sourced our dataset on https://github.com/hongwang600/FLAIR and welcome contributions from the community to enrich such detection datasets.}, added-at = {2023-05-12T17:43:51.000+0200}, author = {Wang, Hong and Luo, Xuan and Wang, Weizhi and Yan, Xifeng}, biburl = {https://www.bibsonomy.org/bibtex/2e23e3356267c487fbad0153ac6a03ef1/khanali21}, description = {Bot or Human? Detecting ChatGPT Imposters with A Single Question}, interhash = {ecc4d3249457ae4c80634c1c899955f2}, intrahash = {e23e3356267c487fbad0153ac6a03ef1}, keywords = {misc}, note = {cite arxiv:2305.06424}, timestamp = {2023-06-06T09:45:40.000+0200}, title = {Bot or Human? Detecting ChatGPT Imposters with A Single Question}, url = {http://arxiv.org/abs/2305.06424}, year = 2023 }

искать в

Метаданные

Последнее изменение год назад
Создан год назад

Комментарии и рецензии
(0)

Комментарии, или рецензии отсутствуют. Вы можете их написать!