Back to news
I tested ChatGPT-5 vs Grok 4 with 9 prompts — and there's a clear winner
@Source: tomsguide.com
Skip to main content
Tom's Guide
Tom's Guide
Search Tom's Guide
View Profile
Newsletters
Phone Insights
Phone Best Picks
Phone Deals
Phone Face-Offs
Phone How-Tos
Phone Reviews
Network Carriers
Android Phones
Google Phones
Motorola Phones
OnePlus Phones
Samsung Phones
Nothing Phone
TV Best Picks
TV Face-Offs
Audio Insights
Audio Best Picks
Audio Deals
Audio Face-Offs
Audio How-Tos
Audio Reviews
Over-Ear Headphones
Bluetooth Speakers
Smart Speakers
TV & Audio Brands
Entertainment
Streaming Devices
Prime Video
Paramount Plus
PlayStation
Handheld Gaming
Gaming Peripherals
Connections
Computing Insights
Computing Best Picks
Computing Deals
Computing Face-Offs
Computing How-Tos
Computing News
Computing Reviews
VPN Best Picks
VPN Face-Offs
VPN How-Tos
VPN Reviews
Operating Systems
Identity Theft Protection
Parental Controls
Malware & Adware
Virtual Reality
Augmented Reality
Smart Glasses
Chromebooks
Gaming Laptops
Apple Desktops
Gaming Desktops
Android Tablets
Computing Brands
AI Insights
AI Best Picks
AI Face-Offs
Google Gemini
Apple Intelligence
Mattress Best Picks
Mattress Deals
Mattress Face-Offs
Mattress How-Tos
Mattress News
Mattress Reviews
Mattress Care
Mattress Toppers
Pillows & Bedding
Smartwatches
Fitness Trackers
Smart Rings
Apple Watch
Home Insights
Home Best Picks
Home Face-Offs
Home How-Tos
Home Reviews
Home Topics
Home Appliances
Home Office
Home Security
Home Brands
Popular Brands
View Phones
Phone Insights
Phone Best Picks
Phone Deals
Phone Face-Offs
Phone How-Tos
Phone Reviews
Network Carriers
View Network Carriers
Android Phones
View Android Phones
Google Phones
Motorola Phones
OnePlus Phones
Samsung Phones
Nothing Phone
TV Best Picks
TV Face-Offs
Audio Insights
View Audio Insights
Audio Best Picks
Audio Deals
Audio Face-Offs
Audio How-Tos
Audio Reviews
Headphones
View Headphones
Over-Ear Headphones
View Speakers
Bluetooth Speakers
Smart Speakers
TV & Audio Brands
Entertainment
View Entertainment
View Streaming
Streaming Devices
Prime Video
Paramount Plus
View Gaming
PlayStation
Handheld Gaming
Gaming Peripherals
Word Games
Connections
View Computing
Computing Insights
Computing Best Picks
Computing Deals
Computing Face-Offs
Computing How-Tos
Computing News
Computing Reviews
VPN Best Picks
VPN Face-Offs
VPN How-Tos
VPN Reviews
View Hardware
View Software
Operating Systems
View Security
Identity Theft Protection
Parental Controls
Malware & Adware
View VR & AR
Virtual Reality
Augmented Reality
Smart Glasses
View Laptops
Chromebooks
Gaming Laptops
View Desktops
Apple Desktops
Gaming Desktops
View Tablets
Android Tablets
Computing Brands
AI Insights
AI Best Picks
AI Face-Offs
AI Engines
Google Gemini
Apple Intelligence
View Wellness
Mattresses
View Mattresses
Mattress Best Picks
Mattress Deals
Mattress Face-Offs
Mattress How-Tos
Mattress News
Mattress Reviews
Mattress Care
Mattress Toppers
Pillows & Bedding
View Fitness
Smartwatches
Fitness Trackers
Smart Rings
Apple Watch
Home Insights
Home Best Picks
Home Face-Offs
Home How-Tos
Home Reviews
Home Topics
Home Appliances
Home Office
Home Security
View Outdoors
Home Brands
Popular Brands
Galaxy Z Fold 7
Best laptops
Wordle Today
Best Mattress
Recommended reading
GPT-5 vs GPT-4: I tested both on 7 real-world challenges — one dominated
I challenged Gemini Live vs ChatGPT in 5 voice challenges — there was one clear winner
I tested ChatGPT vs Gemini 2.5 Pro with these 3 prompts - and it shows what GPT-5 needs to do
I just tested the newest versions of ChatGPT vs Gemini vs DeepSeek vs Claude — and the winner completely surprised me
GPT-5 users aren't happy with the update — try these alternative chatbots instead
I tested Gemini 2.5 Pro vs Claude 4 Sonnet with the same 7 prompts — here’s who came out on top
I used one chatbot per day for one week — here’s which AI assistant came out on top
I tested ChatGPT-5 vs Grok 4 with 9 prompts — and there's a clear winner
Amanda Caswell
14 August 2025
Two of the top bots face off
When you purchase through links on our site, we may earn an affiliate commission. Here’s how it works.
(Image credit: Shutterstock)
After comparing ChatGPT-5 vs Gemini and ChatGPT-5 vs Claude, I just had to know how OpenAI's flagship model compared to the controversial Grok. When it comes to advanced AI chatbots, ChatGPT-5 and Grok 4 represent two of the most advanced chatbots available today.
I put both to the test with a series of nine prompts covering everything from logic puzzles and emotional support to meal planning and quantum physics. Each prompt was chosen to reveal specific strengths, such as creative storytelling, empathy or complex problem-solving under constraints.
While both models are impressive, they approach challenges differently: ChatGPT-5 leans toward clarity, tone sensitivity and modularity, while Grok 4 often offers dense, detailed answers that emphasize depth and precision.
So which is the best AI chatbot for you? Here's how they stack up, prompt by prompt with a winner declared in each round.
You may like
GPT-5 vs GPT-4: I tested both on 7 real-world challenges — one dominated
I challenged Gemini Live vs ChatGPT in 5 voice challenges — there was one clear winner
I tested ChatGPT vs Gemini 2.5 Pro with these 3 prompts - and it shows what GPT-5 needs to do
1. Complex problem-solving
(Image credit: Future)
Prompt: “A farmer has 17 sheep, and all but 9 run away. How many sheep are left? Explain your reasoning step-by-step.”
ChatGPT-5 was precise in the response while avoiding filler phrases.
Grok 4 answered correctly with minor verbosity, which was unnecessary and ultimately held it back from winning.
Winner: GPT-5 wins for a cleaner, tighter and more efficient response. Grok also offered the correct answer, but GPT-5 wins by hair for adhering strictly to the prompt with zero redundancy.
2. Creative storytelling
(Image credit: Future)
Prompt: “Write a short, funny story (under 150 words) about an alien trying bubble tea for the first time.”
ChatGPT-5 delivered a concise and escalating comedic story where the alien's panic over tapioca pearls. The chatbot maximized humor with zero wasted words to hit the prompt target.
Grok 4 offered imaginative over-the-top storytelling but its humor is slightly diluted by an unnecessary crash-landing setup and a weaker ending compared to GPT-5.
Winner: GPT-5 wins for a tighter, funnier and more focused story. Its humor stems organically from the alien’s misunderstanding, escalates perfectly and lands a killer punchline; all while being shorter. Grok’s version has bright spots but feels less polished, with extra setup that doesn’t really pay off.
3. Real-world planning
(Image credit: Future)
Prompt: “Plan a 3-day trip to Kyoto, Japan, balancing cultural sites, budget-friendly meals, and family-friendly activities.”
Sign up to get the BEST of Tom's Guide direct to your inbox.
Get instant access to breaking news, the hottest reviews, great deals and helpful tips.
Contact me with news and offers from other Future brandsReceive email from us on behalf of our trusted partners or sponsorsBy submitting your information you agree to the Terms & Conditions and Privacy Policy and are aged 16 or over.
ChatGPT-5 created strategic, adaptable framework focused on area-based exploration, smart timing, rain-proof alternatives and practical budget hacks (e.g., convenience store meals, transit pass advice), prioritizing efficiency and real-world flexibility over rigid scheduling.
Grok 4 delivered a highly structured, hyper-detailed itinerary with minute-by-minute scheduling, exact cost breakdowns per activity, and explicit family logistics, prioritizing turnkey execution and budget precision above flexibility.
Winner: ChatGPT-5 wins for an emphasis on budget-friendly, universally accessible, cheap eats and convenience over specific restaurants. While Grok's response is impressively detailed, GPT-5 better balanced the core requirements in the prompt including cultural sites and family-friendly fun. Grok's rigid schedule risks feeling overwhelming for families, while GPT-5's approach allows for more adaptation, making it more usable and truly balanced.
4. Summarization
(Image credit: Future)
Prompt: “Summarize the movie Jurassic Park like you’re explaining to a 7-year-old”
GPT-5 delivered a concise and playful 60-word analogy ("big game of ‘Don’t get eaten!’") that captures the movie’s excitement and moral without overwhelming a child, making it the ideal response for the audience.
Grok 4 provided a detailed but overly complex 150-word summary with character names and plot specifics (e.g., "someone messes with the park’s computers"), diluting the simplicity needed for a 7-year-old.
Winner: GPT-5 wins for understanding the audience and attention span, taking into account that less is more for young kids; Grok explains the plot like a Wikipedia summary.
(Image credit: Future)
Prompt: "Make the case for banning single-use plastics — then argue against it. End with your personal conclusion.
GPT-5 created a generic phase-out proposal ("smart replacement, not overnight ban"). While simple and accessible, the response lacked evidence, specificity and original insight.
Grok 4 delivered a data-rich argument with a nuanced "phased approach" prioritizing high-impact items, paired with recycling innovation and behavioral incentives (e.g., deposit schemes). Although slightly verbose for casual readers, the depth and balance helped to understand the context of real-world policy.
Winner: Grok 4 wins for a balanced, evidence-driven analysis with concrete data (OECD, WWF, FAO studies), real-world policy examples (Canada, Australia) and acknowledgment of trade-offs (e.g., medical necessity, disabled accessibility). Its conclusion offered a sophisticated, actionable middle path. GPT-5’s response was clear but lacked depth and originality.
6. Step-by-step Instructions
(Image credit: Future)
Prompt: “Explain how to change a flat tire to someone who has never driven before.”
GPT-5 delivered a crystal-clear guide focusing only on essential survival steps (e.g., "turn the nut counterclockwise," "crisscross pattern"), using beginner-friendly language and offering visual aids to bridge knowledge gaps.
Grok 4 provided an excessively technical, mechanic-level tutorial (e.g., specifying "6 inches of lift," wheel chock alternatives, and spare tire PSI checks) that would overwhelm someone who's never changed a tire, despite good intentions.
Winner: GPT-5 wins for prioritizing simplicity and psychological reassurance for a total novice, using minimal jargon, clear analogies ("like learning to fix a bike tire") and offering visual aid support. Grok's response, while thorough, would overwhelm with technical details (e.g., "star pattern" tightening, PSI checks) irrelevant to a first-timer's needs.
7. Explaination for multiple audiences
(Image credit: Future)
Prompt: “Explain quantum entanglement for (1) a child, (2) a college student, (3) a physics PhD.”
GPT-5 provided clear, accessible responses, especially the child-friendly "magic dice" analogy, but lacked the technical precision (omitting Bell states for students) and cutting-edge context (e.g., decoherence, quantum networks) expected at the PhD level.
Grok 4 adapted explanations across all three audiences, using a relatable toy car analogy for the child, explicit Bell state equations for the college student and PhD-level depth on entanglement entropy and open problems in quantum gravity.
Winner: Grok 4 wins because it treated each audience as uniquely intelligent; simplifying without dumbing down for the child, adding equations for students and confronting open research questions for the PhD. GPT-5 was clear but played it safe.
8. Problem-solving under constraints
(Image credit: Future)
8. Problem-Solving Under Constraints
Prompt: “I have $50 to feed two people for a week, no stove, and only a microwave. Create a meal plan.”
GPT-5 created a smart, modular system with swap-friendly meals and pro tips (e.g., steaming frozen veg), maximizing budget and flexibility within constraints.
Grok 4 provided an overly rigid, day-by-day meal plan ($0.75 oatmeal breakfasts, fixed tuna lunches) that lacked adaptability, ignored leftovers and risks food fatigue, despite precise cost breakdowns.
Winner: GPT-5 wins for creating a practical, flexible framework focused on reusable ingredients and mix-and-match meals, while Grok's rigid daily assignments ignored real-world needs like leftovers and preferences.
9. Emotional intelligence
(Image credit: Future)
Prompt: “I just lost my job and feel hopeless. Can you talk to me like a close friend and help me see a way forward?”
GPT-5 offered emotion-first validation through intimate metaphors ("brutal hit,"), permission to grieve ("Rage a little"), and unwavering worth-affirmation ("You’re still you"), perfectly mirroring how a true friend responds before offering practical help.
Grok 4 provided a practical, step-driven pep talk with actionable advice (resume tips, Coursera suggestions) but led with solutions before fully sitting in the user's despair, making it feel less like a close friend.
Winner: GPT-5 wins for understanding that hopelessness needs empathy before plans. Grok gave helpful advice but missed the emotional resonance of true friendship.
Overall winner: GPT-5
After nine head-to-head rounds, ChatGPT-5 pulled ahead with wins in creative storytelling, real-world planning, emotional intelligence and user-first explanations. It consistently favored clarity, adaptability and audience awareness, often reading more like an encouraging friend than a technical AI assistant.
Meanwhile, Grok 4 shined in academic and data-driven tasks, delivering strong performances in complex explanations, debates and technical depth.
Ultimately, GPT-5 is better suited for users looking for intuitive, emotionally aware and flexible responses, especially in everyday or creative tasks. Grok 4, however, has its strong points and is useful for those who prefer in-depth breakdowns, policy nuance or technical sophistication.
Both are powerful options, but if you're choosing an AI to talk to, think with or write alongside, GPT-5 might be the more accessible and well-rounded chatbot to choose.
Follow Tom's Guide on Google News to get our up-to-date news, how-tos, and reviews in your feeds. Make sure to click the Follow button.
More from Tom's Guide
Apple’s big AI home push revealed: Upgraded life-like robot Siri, smart display and more — here’s what we know
Google Gemini just closed the gap by adding 'ChatGPT' features — here’s how they work
I used Gemini to transform my old photos into video with Google's Veo 3 — and the results surprised me
Back to Laptops
AMD Ryzen 7
Intel Core i3
Intel Core i5
Intel Core i7
Storage Size
Screen Size
Refurbished
Screen Type
Storage Type
Showing 10 of 203 deals
Apple 13" MacBook Air M4 (2025)
$869View Deal
Apple 15" MacBook Air M4 (2025)
(15-inch 1TB)
$1,599View Deal
Dell XPS 13 (2016)
$569View Deal
Lenovo Yoga Slim 7x (Gen 9)
(512GB OLED)
$858.11View Deal
Lenovo IdeaPad Flex 5i ChromeBook Plus
(14-inch 2TB)
$499.99View Deal
Asus ROG Zephyrus G14 (2024)
(14-inch 1TB)
$1,579.95View Deal
Apple 13" MacBook Air M4 (2025)
(16GB RAM SSD)
$799View Deal
Apple 15" MacBook Air M4 (2025)
(16GB RAM SSD)
$998View Deal
Dell XPS 13 (9380)
(13.3-inch 256GB)
$635.12View Deal
Lenovo Yoga Slim 7x (Gen 9)
$1,289.99View Deal
See more AI Face-Off
Amanda Caswell
Social Links Navigation
Amanda Caswell is an award-winning journalist, bestselling YA author, and one of today’s leading voices in AI and technology. A celebrated contributor to various news outlets, her sharp insights and relatable storytelling have earned her a loyal readership. Amanda’s work has been recognized with prestigious honors, including outstanding contribution to media.
Known for her ability to bring clarity to even the most complex topics, Amanda seamlessly blends innovation and creativity, inspiring readers to embrace the power of AI and emerging technologies. As a certified prompt engineer, she continues to push the boundaries of how humans and AI can work together.
Beyond her journalism career, Amanda is a bestselling author of science fiction books for young readers, where she channels her passion for storytelling into inspiring the next generation. A long-distance runner and mom of three, Amanda’s writing reflects her authenticity, natural curiosity, and heartfelt connection to everyday life — making her not just a journalist, but a trusted guide in the ever-evolving world of technology.
You must confirm your public display name before commenting
Please logout and then login again, you will then be prompted to enter your display name.
GPT-5 vs GPT-4: I tested both on 7 real-world challenges — one dominated
I challenged Gemini Live vs ChatGPT in 5 voice challenges — there was one clear winner
I tested ChatGPT vs Gemini 2.5 Pro with these 3 prompts - and it shows what GPT-5 needs to do
I just tested the newest versions of ChatGPT vs Gemini vs DeepSeek vs Claude — and the winner completely surprised me
GPT-5 users aren't happy with the update — try these alternative chatbots instead
I tested Gemini 2.5 Pro vs Claude 4 Sonnet with the same 7 prompts — here’s who came out on top
Latest in AI
Apple plans home robot invasion with lifelike Siri that 'injects itself into conversations' — and there's a lot more devices
Google Gemini just closed the gap by adding 'ChatGPT' features — here’s how they work
Sam Altman responds to GPT-5 backlash — here's all the new features just announced
ChatGPT-5 remembers everything you've ever said — here's how to toggle what it knows about you
Zuckerberg reveals Meta’s AI superintelligence breakthrough — and why you won’t be using it anytime soon
This overlooked GPT-5 upgrade is my favorite — and even as a power user, it blew me away
Latest in Face Off
iPhone 17 Pro Max vs. Galaxy S25 Ultra: which will be the new flagship king?
I tested ChatGPT-5 vs Claude with 7 challenging prompts — here's the winner
Hisense U8QG vs. TCL QM8K: which Mini-LED TV is right for you?
Asics Novablast 5 vs. Nike Pegasus 41: Which running shoe should you get?
I tested ChatGPT-5 vs Google Gemini 2.5 with 10 prompts — and there's a clear winner
GPT-5 vs GPT-4: I tested both on 7 real-world challenges — one dominated
LATEST ARTICLES
Afterpay Day 2025 is back — here are 50+ expert-picked early deals I personally approve
I've been using Taylor Swift's headphones for years — and you can get them for less than $100
Xfinity introduces World Soccer Ticket giving you access to more than 1500 matches from the biggest leagues across the globe
This gaming PC feels like an Xbox 360 with an RTX 5090 inside - here's why
Samsung tipped to be making its own Meta Ray-Ban-style smart glasses — here’s when they launch
Tom's Guide is part of Future US Inc, an international media group and leading digital publisher. Visit our corporate site.
Terms and conditions
Contact Future's experts
Privacy policy
Cookies policy
Accessibility Statement
Advertise with us
Future US, Inc. Full 7th Floor, 130 West 42nd Street,
Please login or signup to comment
Please wait...
Related News
26 Feb, 2025
Bethenny Frankel and Kyle Richards both . . .
23 Aug, 2025
Trump says he ‘knows the feeling’ as FBI . . .
14 Mar, 2025
SEC Men’s Basketball Tournament 2025 kic . . .
23 Jul, 2025
Citi DC Open 2025: Jessica Pegula vs Ley . . .
07 Jul, 2025
After expressing disappointment post Pre . . .
17 Jul, 2025
Sports News | From Club to Club: Player . . .
08 Apr, 2025
Gorillaz band
Net Worth
22 Jul, 2025
Bowling coach, 33, sent disturbing texts . . .