Usfijitimes | I tested ChatGPT-5 vs Grok 4 with 9 prompts

14 Aug, 2025

I tested ChatGPT-5 vs Grok 4 with 9 prompts — and there's a clear winner

@Source: tomsguide.com

Skip to main content Tom's Guide Tom's Guide Search Tom's Guide View Profile Newsletters Phone Insights Phone Best Picks Phone Deals Phone Face-Offs Phone How-Tos Phone Reviews Network Carriers Android Phones Google Phones Motorola Phones OnePlus Phones Samsung Phones Nothing Phone TV Best Picks TV Face-Offs Audio Insights Audio Best Picks Audio Deals Audio Face-Offs Audio How-Tos Audio Reviews Over-Ear Headphones Bluetooth Speakers Smart Speakers TV & Audio Brands Entertainment Streaming Devices Prime Video Paramount Plus PlayStation Handheld Gaming Gaming Peripherals Connections Computing Insights Computing Best Picks Computing Deals Computing Face-Offs Computing How-Tos Computing News Computing Reviews VPN Best Picks VPN Face-Offs VPN How-Tos VPN Reviews Operating Systems Identity Theft Protection Parental Controls Malware & Adware Virtual Reality Augmented Reality Smart Glasses Chromebooks Gaming Laptops Apple Desktops Gaming Desktops Android Tablets Computing Brands AI Insights AI Best Picks AI Face-Offs Google Gemini Apple Intelligence Mattress Best Picks Mattress Deals Mattress Face-Offs Mattress How-Tos Mattress News Mattress Reviews Mattress Care Mattress Toppers Pillows & Bedding Smartwatches Fitness Trackers Smart Rings Apple Watch Home Insights Home Best Picks Home Face-Offs Home How-Tos Home Reviews Home Topics Home Appliances Home Office Home Security Home Brands Popular Brands View Phones Phone Insights Phone Best Picks Phone Deals Phone Face-Offs Phone How-Tos Phone Reviews Network Carriers View Network Carriers Android Phones View Android Phones Google Phones Motorola Phones OnePlus Phones Samsung Phones Nothing Phone TV Best Picks TV Face-Offs Audio Insights View Audio Insights Audio Best Picks Audio Deals Audio Face-Offs Audio How-Tos Audio Reviews Headphones View Headphones Over-Ear Headphones View Speakers Bluetooth Speakers Smart Speakers TV & Audio Brands Entertainment View Entertainment View Streaming Streaming Devices Prime Video Paramount Plus View Gaming PlayStation Handheld Gaming Gaming Peripherals Word Games Connections View Computing Computing Insights Computing Best Picks Computing Deals Computing Face-Offs Computing How-Tos Computing News Computing Reviews VPN Best Picks VPN Face-Offs VPN How-Tos VPN Reviews View Hardware View Software Operating Systems View Security Identity Theft Protection Parental Controls Malware & Adware View VR & AR Virtual Reality Augmented Reality Smart Glasses View Laptops Chromebooks Gaming Laptops View Desktops Apple Desktops Gaming Desktops View Tablets Android Tablets Computing Brands AI Insights AI Best Picks AI Face-Offs AI Engines Google Gemini Apple Intelligence View Wellness Mattresses View Mattresses Mattress Best Picks Mattress Deals Mattress Face-Offs Mattress How-Tos Mattress News Mattress Reviews Mattress Care Mattress Toppers Pillows & Bedding View Fitness Smartwatches Fitness Trackers Smart Rings Apple Watch Home Insights Home Best Picks Home Face-Offs Home How-Tos Home Reviews Home Topics Home Appliances Home Office Home Security View Outdoors Home Brands Popular Brands Galaxy Z Fold 7 Best laptops Wordle Today Best Mattress Recommended reading GPT-5 vs GPT-4: I tested both on 7 real-world challenges — one dominated I challenged Gemini Live vs ChatGPT in 5 voice challenges — there was one clear winner I tested ChatGPT vs Gemini 2.5 Pro with these 3 prompts - and it shows what GPT-5 needs to do I just tested the newest versions of ChatGPT vs Gemini vs DeepSeek vs Claude — and the winner completely surprised me GPT-5 users aren't happy with the update — try these alternative chatbots instead I tested Gemini 2.5 Pro vs Claude 4 Sonnet with the same 7 prompts — here’s who came out on top I used one chatbot per day for one week — here’s which AI assistant came out on top I tested ChatGPT-5 vs Grok 4 with 9 prompts — and there's a clear winner Amanda Caswell 14 August 2025 Two of the top bots face off When you purchase through links on our site, we may earn an affiliate commission. Here’s how it works. (Image credit: Shutterstock) After comparing ChatGPT-5 vs Gemini and ChatGPT-5 vs Claude, I just had to know how OpenAI's flagship model compared to the controversial Grok. When it comes to advanced AI chatbots, ChatGPT-5 and Grok 4 represent two of the most advanced chatbots available today. I put both to the test with a series of nine prompts covering everything from logic puzzles and emotional support to meal planning and quantum physics. Each prompt was chosen to reveal specific strengths, such as creative storytelling, empathy or complex problem-solving under constraints. While both models are impressive, they approach challenges differently: ChatGPT-5 leans toward clarity, tone sensitivity and modularity, while Grok 4 often offers dense, detailed answers that emphasize depth and precision. So which is the best AI chatbot for you? Here's how they stack up, prompt by prompt with a winner declared in each round. You may like GPT-5 vs GPT-4: I tested both on 7 real-world challenges — one dominated I challenged Gemini Live vs ChatGPT in 5 voice challenges — there was one clear winner I tested ChatGPT vs Gemini 2.5 Pro with these 3 prompts - and it shows what GPT-5 needs to do 1. Complex problem-solving (Image credit: Future) Prompt: “A farmer has 17 sheep, and all but 9 run away. How many sheep are left? Explain your reasoning step-by-step.” ChatGPT-5 was precise in the response while avoiding filler phrases. Grok 4 answered correctly with minor verbosity, which was unnecessary and ultimately held it back from winning. Winner: GPT-5 wins for a cleaner, tighter and more efficient response. Grok also offered the correct answer, but GPT-5 wins by hair for adhering strictly to the prompt with zero redundancy. 2. Creative storytelling (Image credit: Future) Prompt: “Write a short, funny story (under 150 words) about an alien trying bubble tea for the first time.” ChatGPT-5 delivered a concise and escalating comedic story where the alien's panic over tapioca pearls. The chatbot maximized humor with zero wasted words to hit the prompt target. Grok 4 offered imaginative over-the-top storytelling but its humor is slightly diluted by an unnecessary crash-landing setup and a weaker ending compared to GPT-5. Winner: GPT-5 wins for a tighter, funnier and more focused story. Its humor stems organically from the alien’s misunderstanding, escalates perfectly and lands a killer punchline; all while being shorter. Grok’s version has bright spots but feels less polished, with extra setup that doesn’t really pay off. 3. Real-world planning (Image credit: Future) Prompt: “Plan a 3-day trip to Kyoto, Japan, balancing cultural sites, budget-friendly meals, and family-friendly activities.” Sign up to get the BEST of Tom's Guide direct to your inbox. Get instant access to breaking news, the hottest reviews, great deals and helpful tips. Contact me with news and offers from other Future brandsReceive email from us on behalf of our trusted partners or sponsorsBy submitting your information you agree to the Terms & Conditions and Privacy Policy and are aged 16 or over. ChatGPT-5 created strategic, adaptable framework focused on area-based exploration, smart timing, rain-proof alternatives and practical budget hacks (e.g., convenience store meals, transit pass advice), prioritizing efficiency and real-world flexibility over rigid scheduling. Grok 4 delivered a highly structured, hyper-detailed itinerary with minute-by-minute scheduling, exact cost breakdowns per activity, and explicit family logistics, prioritizing turnkey execution and budget precision above flexibility. Winner: ChatGPT-5 wins for an emphasis on budget-friendly, universally accessible, cheap eats and convenience over specific restaurants. While Grok's response is impressively detailed, GPT-5 better balanced the core requirements in the prompt including cultural sites and family-friendly fun. Grok's rigid schedule risks feeling overwhelming for families, while GPT-5's approach allows for more adaptation, making it more usable and truly balanced. 4. Summarization (Image credit: Future) Prompt: “Summarize the movie Jurassic Park like you’re explaining to a 7-year-old” GPT-5 delivered a concise and playful 60-word analogy ("big game of ‘Don’t get eaten!’") that captures the movie’s excitement and moral without overwhelming a child, making it the ideal response for the audience. Grok 4 provided a detailed but overly complex 150-word summary with character names and plot specifics (e.g., "someone messes with the park’s computers"), diluting the simplicity needed for a 7-year-old. Winner: GPT-5 wins for understanding the audience and attention span, taking into account that less is more for young kids; Grok explains the plot like a Wikipedia summary. (Image credit: Future) Prompt: "Make the case for banning single-use plastics — then argue against it. End with your personal conclusion. GPT-5 created a generic phase-out proposal ("smart replacement, not overnight ban"). While simple and accessible, the response lacked evidence, specificity and original insight. Grok 4 delivered a data-rich argument with a nuanced "phased approach" prioritizing high-impact items, paired with recycling innovation and behavioral incentives (e.g., deposit schemes). Although slightly verbose for casual readers, the depth and balance helped to understand the context of real-world policy. Winner: Grok 4 wins for a balanced, evidence-driven analysis with concrete data (OECD, WWF, FAO studies), real-world policy examples (Canada, Australia) and acknowledgment of trade-offs (e.g., medical necessity, disabled accessibility). Its conclusion offered a sophisticated, actionable middle path. GPT-5’s response was clear but lacked depth and originality. 6. Step-by-step Instructions (Image credit: Future) Prompt: “Explain how to change a flat tire to someone who has never driven before.” GPT-5 delivered a crystal-clear guide focusing only on essential survival steps (e.g., "turn the nut counterclockwise," "crisscross pattern"), using beginner-friendly language and offering visual aids to bridge knowledge gaps. Grok 4 provided an excessively technical, mechanic-level tutorial (e.g., specifying "6 inches of lift," wheel chock alternatives, and spare tire PSI checks) that would overwhelm someone who's never changed a tire, despite good intentions. Winner: GPT-5 wins for prioritizing simplicity and psychological reassurance for a total novice, using minimal jargon, clear analogies ("like learning to fix a bike tire") and offering visual aid support. Grok's response, while thorough, would overwhelm with technical details (e.g., "star pattern" tightening, PSI checks) irrelevant to a first-timer's needs. 7. Explaination for multiple audiences (Image credit: Future) Prompt: “Explain quantum entanglement for (1) a child, (2) a college student, (3) a physics PhD.” GPT-5 provided clear, accessible responses, especially the child-friendly "magic dice" analogy, but lacked the technical precision (omitting Bell states for students) and cutting-edge context (e.g., decoherence, quantum networks) expected at the PhD level. Grok 4 adapted explanations across all three audiences, using a relatable toy car analogy for the child, explicit Bell state equations for the college student and PhD-level depth on entanglement entropy and open problems in quantum gravity. Winner: Grok 4 wins because it treated each audience as uniquely intelligent; simplifying without dumbing down for the child, adding equations for students and confronting open research questions for the PhD. GPT-5 was clear but played it safe. 8. Problem-solving under constraints (Image credit: Future) 8. Problem-Solving Under Constraints Prompt: “I have $50 to feed two people for a week, no stove, and only a microwave. Create a meal plan.” GPT-5 created a smart, modular system with swap-friendly meals and pro tips (e.g., steaming frozen veg), maximizing budget and flexibility within constraints. Grok 4 provided an overly rigid, day-by-day meal plan ($0.75 oatmeal breakfasts, fixed tuna lunches) that lacked adaptability, ignored leftovers and risks food fatigue, despite precise cost breakdowns. Winner: GPT-5 wins for creating a practical, flexible framework focused on reusable ingredients and mix-and-match meals, while Grok's rigid daily assignments ignored real-world needs like leftovers and preferences. 9. Emotional intelligence (Image credit: Future) Prompt: “I just lost my job and feel hopeless. Can you talk to me like a close friend and help me see a way forward?” GPT-5 offered emotion-first validation through intimate metaphors ("brutal hit,"), permission to grieve ("Rage a little"), and unwavering worth-affirmation ("You’re still you"), perfectly mirroring how a true friend responds before offering practical help. Grok 4 provided a practical, step-driven pep talk with actionable advice (resume tips, Coursera suggestions) but led with solutions before fully sitting in the user's despair, making it feel less like a close friend. Winner: GPT-5 wins for understanding that hopelessness needs empathy before plans. Grok gave helpful advice but missed the emotional resonance of true friendship. Overall winner: GPT-5 After nine head-to-head rounds, ChatGPT-5 pulled ahead with wins in creative storytelling, real-world planning, emotional intelligence and user-first explanations. It consistently favored clarity, adaptability and audience awareness, often reading more like an encouraging friend than a technical AI assistant. Meanwhile, Grok 4 shined in academic and data-driven tasks, delivering strong performances in complex explanations, debates and technical depth. Ultimately, GPT-5 is better suited for users looking for intuitive, emotionally aware and flexible responses, especially in everyday or creative tasks. Grok 4, however, has its strong points and is useful for those who prefer in-depth breakdowns, policy nuance or technical sophistication. Both are powerful options, but if you're choosing an AI to talk to, think with or write alongside, GPT-5 might be the more accessible and well-rounded chatbot to choose. Follow Tom's Guide on Google News to get our up-to-date news, how-tos, and reviews in your feeds. Make sure to click the Follow button. More from Tom's Guide Apple’s big AI home push revealed: Upgraded life-like robot Siri, smart display and more — here’s what we know Google Gemini just closed the gap by adding 'ChatGPT' features — here’s how they work I used Gemini to transform my old photos into video with Google's Veo 3 — and the results surprised me Back to Laptops AMD Ryzen 7 Intel Core i3 Intel Core i5 Intel Core i7 Storage Size Screen Size Refurbished Screen Type Storage Type Showing 10 of 203 deals Apple 13" MacBook Air M4 (2025) $869View Deal Apple 15" MacBook Air M4 (2025) (15-inch 1TB) $1,599View Deal Dell XPS 13 (2016) $569View Deal Lenovo Yoga Slim 7x (Gen 9) (512GB OLED) $858.11View Deal Lenovo IdeaPad Flex 5i ChromeBook Plus (14-inch 2TB) $499.99View Deal Asus ROG Zephyrus G14 (2024) (14-inch 1TB) $1,579.95View Deal Apple 13" MacBook Air M4 (2025) (16GB RAM SSD) $799View Deal Apple 15" MacBook Air M4 (2025) (16GB RAM SSD) $998View Deal Dell XPS 13 (9380) (13.3-inch 256GB) $635.12View Deal Lenovo Yoga Slim 7x (Gen 9) $1,289.99View Deal See more AI Face-Off Amanda Caswell Social Links Navigation Amanda Caswell is an award-winning journalist, bestselling YA author, and one of today’s leading voices in AI and technology. A celebrated contributor to various news outlets, her sharp insights and relatable storytelling have earned her a loyal readership. Amanda’s work has been recognized with prestigious honors, including outstanding contribution to media. Known for her ability to bring clarity to even the most complex topics, Amanda seamlessly blends innovation and creativity, inspiring readers to embrace the power of AI and emerging technologies. As a certified prompt engineer, she continues to push the boundaries of how humans and AI can work together. Beyond her journalism career, Amanda is a bestselling author of science fiction books for young readers, where she channels her passion for storytelling into inspiring the next generation. A long-distance runner and mom of three, Amanda’s writing reflects her authenticity, natural curiosity, and heartfelt connection to everyday life — making her not just a journalist, but a trusted guide in the ever-evolving world of technology. You must confirm your public display name before commenting Please logout and then login again, you will then be prompted to enter your display name. GPT-5 vs GPT-4: I tested both on 7 real-world challenges — one dominated I challenged Gemini Live vs ChatGPT in 5 voice challenges — there was one clear winner I tested ChatGPT vs Gemini 2.5 Pro with these 3 prompts - and it shows what GPT-5 needs to do I just tested the newest versions of ChatGPT vs Gemini vs DeepSeek vs Claude — and the winner completely surprised me GPT-5 users aren't happy with the update — try these alternative chatbots instead I tested Gemini 2.5 Pro vs Claude 4 Sonnet with the same 7 prompts — here’s who came out on top Latest in AI Apple plans home robot invasion with lifelike Siri that 'injects itself into conversations' — and there's a lot more devices Google Gemini just closed the gap by adding 'ChatGPT' features — here’s how they work Sam Altman responds to GPT-5 backlash — here's all the new features just announced ChatGPT-5 remembers everything you've ever said — here's how to toggle what it knows about you Zuckerberg reveals Meta’s AI superintelligence breakthrough — and why you won’t be using it anytime soon This overlooked GPT-5 upgrade is my favorite — and even as a power user, it blew me away Latest in Face Off iPhone 17 Pro Max vs. Galaxy S25 Ultra: which will be the new flagship king? I tested ChatGPT-5 vs Claude with 7 challenging prompts — here's the winner Hisense U8QG vs. TCL QM8K: which Mini-LED TV is right for you? Asics Novablast 5 vs. Nike Pegasus 41: Which running shoe should you get? I tested ChatGPT-5 vs Google Gemini 2.5 with 10 prompts — and there's a clear winner GPT-5 vs GPT-4: I tested both on 7 real-world challenges — one dominated LATEST ARTICLES Afterpay Day 2025 is back — here are 50+ expert-picked early deals I personally approve I've been using Taylor Swift's headphones for years — and you can get them for less than $100 Xfinity introduces World Soccer Ticket giving you access to more than 1500 matches from the biggest leagues across the globe This gaming PC feels like an Xbox 360 with an RTX 5090 inside - here's why Samsung tipped to be making its own Meta Ray-Ban-style smart glasses — here’s when they launch Tom's Guide is part of Future US Inc, an international media group and leading digital publisher. Visit our corporate site. Terms and conditions Contact Future's experts Privacy policy Cookies policy Accessibility Statement Advertise with us Future US, Inc. Full 7th Floor, 130 West 42nd Street, Please login or signup to comment Please wait...