


GPT-4o-2024-05-13 is the initial release version that established the GPT-4o multimodal model.
GPT-4o-2024-05-13, developed by OpenAI, marks the initial release of the GPT-4o series, a state-of-the-art multimodal language model designed to process and generate text, images, and audio. Launched on May 13, 2024, this version emphasizes real-time interaction capabilities and supports complex multi-step tasks across various data types, making it highly versatile for dynamic applications.

GPT-4o-2024-05-13 utilizes a transformer architecture with a native context window of 128,000 tokens and can generate up to 16,384 output tokens per request. It is trained on diverse multimodal datasets spanning text, images, and audio across multiple domains to ensure broad knowledge and robustness. The knowledge cutoff for this model is October 2023.
Learn more about this and other models and their applications in Healthcare here.
The model achieves an impressive MMLU score of 88.7 (5-shot), demonstrating strong knowledge proficiency, and a HumanEval score of 91.0 (0-shot), reflecting its advanced programming capabilities. Multimodal benchmark performance (MMMU score) is 69.1, validating its ability to handle audio and visual inputs effectively. It generates text at an approximate speed of 72 to 109 tokens per second, with an average response latency around 320 milliseconds, substantially faster than predecessors like GPT-4 Turbo. GPT-4o is also about 50% more cost-effective on input and output tokens compared to GPT-4 Turbo.
As GPT-4o currently points to this version (GPT-4o-2024-05-13), while comparing the models focus on GPT-4o.
.png)
Compared to GPT-4 Turbo, GPT-4o-2024-05-13 delivers:
The model is available on the AI/ML API platform as "gpt-4o-2024-05-13".
Detailed API Documentation is available on the AI/ML API website, providing comprehensive guidelines for integration
OpenAI applies stringent safety and bias mitigation protocols to GPT-4o, ensuring responsible and fair model use. The model is available with commercial usage rights, allowing businesses to seamlessly adopt it into their applications.