DEVELOPER

New OpenAI model GPT-4o Mini retires GPT-3.5

19/07/2024

As the basis of ChatGPT, GPT-3.5 is probably the most influential major language model of all. Now it will have to give way to a successor, which OpenAI has now presented. GPT-4o mini is the smaller version of GPT-4o, which OpenAI released in May. Like its big brother, it is designed as a multimodal model, but at the moment it is limited in this regard. It can now process graphical inputs via APIs. The future will produce image, video and audio outputs.

A multimodal model with lots of performance for the money

GPT-4o Mini is trained on data extended to October 2023. Although the context window is about eight times larger than GPT-3.5 Turbo with 128k tokens, it is still smaller than Anthropic’s smallest model, Cloud 3 Haiku. However, it can generate 16,000 output tokens, which is far more than most comparable models. Even Cloud Sonnet, which competes in the next size category, only achieves half of that. In terms of output speed, GPT-4o Mini also leads the most popular LLM with 166 tokens per second.

As expected, in the benchmarks published by OpenAI it also tops smaller models like Google’s Haiku or Gemini Flash, but the gap is not particularly large. OpenAI could really score points with its price-performance ratio, at least for the moment. Both Anthropic and Google have based themselves on GPT-3.5 price expectations and are therefore currently significantly more expensive.

In general AI benchmarks, GPT-4o currently ranks at the top among “small” large language models. But the distances are not particularly large.

(Image: OpenAI)

GPT-4o Mini charges 15 cents per million input tokens and 60 cents per million output tokens. This is about 60 percent less than its predecessor. For comparison: for the larger GPT-4.o, OpenAI charges $5 per million inputs and $15 per million output tokens. That is more than thirty times more.

According to OpenAI, GTP-4o is the company’s first AI model to use a mini instruction hierarchy. This technique lets the model prioritize certain instructions over others. It is intended to make prompt injection attacks, jailbreaks, or system prompt extraction more difficult for users who bypass the built-in changes or instructions provided by the system prompt.

(ulw)

New OpenAI model GPT-4o Mini retires GPT-3.5

A multimodal model with lots of performance for the money

LEAVE A REPLY Cancel reply

EDITOR PICKS

LKA warns false ETA application pages for admission to Great Britain

Change of the era: iPhone only big, with Minale 128 GB and Worth Lightning

Product Staff: Don’t be a lone warrior, reach out and get help!

POPULAR POSTS

With iOS 18: iPhone can restore iPhone 16

WhatsApp is higher than the threshold-stricter regulation of the European Union

In the first test: Xiaomi 15 Ultra with Leica Camera on MWC 2025

POPULAR CATEGORY

ABOUT US

FOLLOW US

With INCron or Inotify: Linux automatically digitize paper template text