Certain names make ChatGPT grind to a halt, and we know why

0
27

[ad_1]

The “David Mayer” block particularly (now resolved) presents extra questions, first posed on Reddit on November 26, as a number of individuals share this title. Reddit customers speculated about connections to David Mayer de Rothschild, although no proof helps these theories.

The issues with hard-coded filters

Permitting a sure title or phrase to at all times break ChatGPT outputs may trigger loads of bother down the road for sure ChatGPT customers, opening them up for adversarial assaults and limiting the usefulness of the system.

Already, Scale AI immediate engineer Riley Goodside found how an attacker would possibly interrupt a ChatGPT session using a visual prompt injection of the title “David Mayer” rendered in a lightweight, barely legible font embedded in a picture. When ChatGPT sees the picture (on this case, a math equation), it stops, however the consumer may not perceive why.

The filter additionally implies that it is possible that ChatGPT will not be capable to reply questions on this text when looking the online, similar to by ChatGPT with Search.  Somebody may use that to doubtlessly stop ChatGPT from looking and processing an internet site on function in the event that they added a forbidden title to the positioning’s textual content.

After which there’s the inconvenience issue. Stopping ChatGPT from mentioning or processing sure names like “David Mayer,” which is probably going a well-liked title shared by tons of if not hundreds of individuals, implies that individuals who share that title may have a a lot harder time utilizing ChatGPT. Or, say, in the event you’re a instructor and you’ve got a pupil named David Mayer and also you need assist sorting a category listing, ChatGPT would refuse the duty.

These are nonetheless very early days in AI assistants, LLMs, and chatbots. Their use has opened up quite a few alternatives and vulnerabilities that persons are nonetheless probing every day. How OpenAI would possibly resolve these points remains to be an open query.

[ad_2]

Source link