Tag
An increasing number of users are shifting from heavily aligned cloud LLMs like ChatGPT, Claude, and Gemini to local or uncensored alternatives due to frequent refusals, privacy concerns, and desire for more control, though cloud models retain advantages in speed and ease of use.
OpenAI introduces InstructGPT, a GPT-3 variant fine-tuned using reinforcement learning from human feedback (RLHF) to better follow instructions and reduce harmful outputs. A 1.3B InstructGPT model is preferred by human evaluators over a 175B GPT-3 model, now becoming the default on OpenAI's API.