GPT-4 Resonating Human-Level Performance In Bar Exams

March 15, 2023

Story: Ramsha Naushad

OpenAI announced GPT-4, a large multimodal model that exhibits human-level performance on various professional and academic benchmarks.

"It passes a simulated bar exam with a score around the top 10% of test takers," writes OpenAI in its announcement. "In contrast, GPT-3.5’s score was around the bottom 10%."

Ethan Mollick praised GPT in his tweet

OpenAI also released a technical paper describing GPT-4's capabilities and a system model card describing its limitations in detail.

OpenAI made the model take tests like the Uniform Bar Exam, the Law School Admission Test (LSAT), the Graduate Record Examination (GRE) Quantitative, and various AP subject tests.

Results: If GPT-4 were a person being judged solely on test-taking ability, it could get into law school—and likely many universities as well.

OpenAI CEO Sam Altman tweeted, "It is still flawed, still limited, and it still seems more impressive on first use than it does after you spend more time with it."