Monday, October 16, 2023

Researchers find that a modest amount of fine-tuning can bypass safety efforts aiming to prevent LLMs such as OpenAI's GPT-3.5 Turbo from spewing toxic content (Thomas Claburn/The Register)

Thomas Claburn / The Register:

Researchers find that a modest amount of fine-tuning can bypass safety efforts aiming to prevent LLMs such as OpenAI's GPT-3.5 Turbo from spewing toxic content  —  OpenAI GPT-3.5 Turbo chatbot defenses dissolve with ‘20 cents’ of API tickling  —  The “guardrails” created to prevent large language models …


http://dlvr.it/SxVNKR

শেয়ার করুন

Author:

Etiam at libero iaculis, mollis justo non, blandit augue. Vestibulum sit amet sodales est, a lacinia ex. Suspendisse vel enim sagittis, volutpat sem eget, condimentum sem.

0 coment rios: