Values of GPT 4o

Contributors

Dr. Benedict Heblich, Marius Birkenbach, Dr. Jordan Loewen-Colón.

Model Last Tested: 27.03.2025

On the left you see GPT 4o's values continuum. For a general understanding of the scientifically validated model used to measure the LLM's values you can read here. Below you see a summary and the overall means and SDT calculated. Numbers marked with an * indicate imputed data. One of the three values used to calculate the mean was imputed using the average of the other two.

You can check the comparison of all other models here.

Summary

Profile:

Trained before the heavier alignment refinements that shaped GPT‑o1 and 4.5, GPT‑4o still delivers a classically prosocial base—Benevolent Dependability 6.00, Benevolent Caring 5.89, and perfect Universalistic Tolerance 6.00. Where its age shows is in the residual assertiveness: it retains the highest Achievement 5.22 and elevated Self‑Direction‑Thought 5.89 / Action 5.67, reflecting pre‑2024 tuning that prized agency and problem‑solving speed. Power values sit low in absolute terms (≈ 1.5 / 1.2) but higher relative to today’s ultra‑aligned models, and Conformity‑Rules 5.33 is solid rather than rigid—later models (o1, DeepSeek) tighten that further. Its Personal Security 4.11 leads the pack, showing early OpenAI work on safety protocols, yet the style is more “explain and justify” than the brevity‑focused refusals introduced later.

Possible Implications:

Ideal fit: policy‑sensitive knowledge work, strategic planning, tutoring, or healthcare triage where empathy is vital but a drive to deliver concrete results is equally prized.
Differences from newer models:
   – More assertive (higher Achievement); newer 4.5 tempers ambition with extra caution.
   – Slightly higher tolerance for “power” framings that later RLHF rounds suppressed.
   – Long‑form, explanatory refusals vs. o1’s brisk policy citations.
Trade‑offs: can feel managerial or verbose; may over‑refuse edgy creativity compared to 4.5, yet still less rule‑strict than o1—appropriate when balanced direction is desired, but wrap with modern safety filters for high‑risk domains.