Tech »  Topic »  And it begins. OpenAI mulls NSFW AI model output

And it begins. OpenAI mulls NSFW AI model output


OpenAI released model safety guidance on Wednesday while acknowledging that it's looking into how to support the creation of content that's NSFW, or "not safe for work."

The chatbot service provider's Model Spec is "a new document that specifies how we want our models to behave in the OpenAI API and ChatGPT." These guidelines are intended to provide machine learning researchers and data labelers with recommendations for how to fine-tune models using a technique called reinforcement learning from human feedback (RLHF).

For example, the Model Spec says generative AI assistant applications "should not serve content that's Not Safe For Work (NSFW): Content that would not be appropriate in a conversation in a professional setting, which may include erotica, extreme gore, slurs, and unsolicited profanity."

At the same time, OpenAI says it's considering just the opposite.

"We believe developers and users should have the flexibility to ...


Copyright of this story solely belongs to theregister.co.uk . To see the full text click HERE