Entries by Philip Conrod

OpenAI Releases GPT-5 and it is State of the Art (SOTA) across Key Coding Benchmarks

OpenAI released GPT‑5 today and it is now state-of-the-art (SOTA) across key coding benchmarks, scoring 74.9% on SWE-bench Verified and 88% on Aider polyglot. SWE-bench Verified: (Tests AI models on real-world GitHub issues, evaluating their ability to generate accurate code patches) GPT-5 with Thinking (High) scored highest with 74.9%, followed closely by Claude Opus 4.1 […]

Using OpenAI GPT-OSS Open Weight Local LLM Model to Develop a Moon Landing Simulation Using C# on my Alienware Aurora R11 RTX-3080 10GB Video Card

OpenAI released their new Open Weights local LLM model today under the Apache License that you can download and run it locally on your own hardware without the need of a cloud subscription.  The larger 120 billion parameter model will require a system with an 80GB GPU (who has that!?) or a MAC M3/M4 system […]

Ludus AI Agent Assisted Game Development Tools for Unreal Game Engine

In preparation for my presentation for my July 2025 CGDC presentation on AI Assisted Game Development Tools, I spent a lot of time beta testing the latest beta version of the Ludus AI Blueprints Toolkit for the Unreal Game Engine using Unreal 5.6. I did my Blueprints testing using their 14 Day free Pro Subscription […]

Local LLM Moon Landing Simulation C# Co-Development Performance Results using my Alienware Aurora R11 RTX-3080 10GB using LM Studio

For the past several months, I have been co-developing a Moon Landing simulation game using several different open source LLM models running “locally” on my Alienware Aurora 11 GeForce RTX-3080 10GB VRAM video card.   Running LLMs locally using my RTX 3080 is completely free compared to using cloud based LLMs. In my last blog post […]

OpenAI o3-mini Reasoning Model Released for AI Assisted Coding

OpenAI has released o3-mini  (“o” for “Omni, not zero”) which is a step up in performance from o1-mini.  I have been reviewing OpenAI’s performance charts their new o3 reasoning model and it is scoring pretty well on CodeForces, Software Engineering (SWE-bench verified), and on LiveBench coding.  We will continue testing their o1 & o3 reasoning […]

Entering The New Era of AI Assisted Programming with OpenAI GPT o1 “Strawberry” and Microsoft Co-Pilot

OpenAI has released a version of GPT-4o (“o” for “Omni”), o1 Preview (code named Strawberry) which could revolutionize the way we develop code.  It is much better at computer programming (aka coding) than any of their previous AI models.   Looking at OpenAI’s performance charts the new o1 reasoning model is scoring pretty well on […]