Gemini 3

Google’s Secret Weapon: Why Gemini 3 Is The Only AI Tool You’ll Need in 2026

Introduction & The “Thinking Mode”

Why do I have this or this more on that in just a second but ladies and gentlemen Gemini 3 is finally here and this is by far the best and smartest AI model you can use right now it’s not even close. So in this Article I’m going to go over all the crazy and useful things you can do with this plus I’m going to go over its specs and benchmarks compared to other AI models and of course I’m going to go over where you can use this let’s jump right in now for this video I’m mainly going to use Gemini 3 on the Gemini platform. This should be available to everyone right now already if you click here notice that you can select this thinking mode which uses Gemini 3 Pro so that’s what I’m going to select now note that all the top AI models are already great at doing simple stuff like replying to emails or writing a social media post so here I’m really trying to test its limits by giving it some really challenging prompts i have really high expectations for Gemini 3

Stress-Testing Gemini 3: Coding a Windows 11 Clone

Vibe coding

So let’s start off with a super hard prompt already make a clone of the Windows 11 desktop use the original wallpaper on the desktop there should be icons for MSWord Paint Calculator and Chrome each program should work use working images and then put everything in a standalone HTML file and this is a key phrase I like to use to make sure everything is self-contained and then under tools I’m going to select canvas so you can actually preview the app in the side window as you’ll see in a second let’s click run all right and here’s what we got first of all let’s expand the thinking first it’s examining the scope analyzing functional requirements etc etc and then it’s optimizing the style curating the visual assets and that’s pretty much it. It’s a pretty short thinking process it’s not going to waste a ton of tokens but here you go here is what indeed looks like a Windows 11 desktop. Now if I double click on Word holy smokes it does open up Microsoft Word i can type text over here and let’s try to actually change the font o f this okay it looks like I can’t select any other font or font size let’s try to make this bold so bold works italics works and underline also works. Now instead of pressing these buttons let me press Ctrl +B so that shortcut also works let me now press Ctrl I and CtrlU and those keyboard shortcuts also work. Next let me maximize this window and it actually maximizes the window now let me minimize this and you can see down here it’s actually you know minimized over here as you can see from this blue dot next let me open up Google Chrome and it looks like it’s loading up Wikipedia and it’s actually pulling up Wikipedia holy smokes. Let’s see if this actually works so let’s try to search for something like Google and then let’s click on this one and it actually pulls up the Google Wikipedia page this is crazy how it actually coded up a working internet browser. Next let me exit out of this and then let me double click on paint and here we have Microsoft Paint so let me increase the size of this and change the color next let me change the color again it’s pretty simple i don’t even see any like eraser button here or any shapes or anything like that let’s press clear so that works all these functions work right out of the box. Next let’s try calculator so let’s try like 6 * 2 that works very nice let me exit out of this and then what about recycle bin all right that doesn’t do anything but it’s cuz I didn’t really specify this in the prompt and then it even has this nice like weather icon down here. Let me try to click on the start menu okay so that doesn’t work this search function also doesn’t work so I think you’ll need to like explicitly specify this in the prompt and then these buttons also don’t really do anything but overall I mean from just one prompt this is already really good as you can see Gemini 3 is just insanely good at coding.

Advanced Multimodal Capabilities: Solving Visual Puzzles

Now the awesome thing about Gemini 3 is this is multimodal which means it can also understand audio and images so next let me feed it t his image which is actually a visual puzzle it’s called a stereoggram and actually if you stare at it long enough an object will pop up in 3D no I’m not joking like if you actually stare at it something will pop up. Now if you don’t want me to spoil the answer pause this video and just stare at the image first before you move on so what I’m going to do is upload this image over here and then for the prompt I’m going to write this is a stereoggram visual puzzle what’s the object and that’s pretty much it. Let’s click generate and it correctly identified that this is an airplane by the way I tried this with all the other top AI models none of them could get this correct gemini 3 is the first one to actually correctly identify that this is a plane very impressive. All right next let’s test it on some other visually challenging tasks so here’s an image and there’s a cat hidden somewhere in this image again if you don’t want me to spoil the answer pause the video and stare at it until you find the cat but anyways I’m going to upload this image into here and then simply prompt it to find the cat in this photo all right so actually let me expand the thinking process so you can see how it thinks through the answer. So it seems to be prioritizing edge detection and pattern recognition it was mentally zooming in on a specific log and it found a seemingly smoother piece of wood in the central stack. It reviewed a sleeping tawny orange cat stretched out on top of the log yada yada yada and that’s indeed true and here’s the answer it’s a ginger orange tabby its fur matches the color of the wood perfectly. Here’s how to spot it look at the top center of the wood pile identify the highest stack on the far left and then look directly to the right where the pile steps down a bit the cat is sleeping horizontally on top of the uppermost log and that’s indeed true. So if we look at this central log pile and then we look at the stack on the far left and we look at where it steps down a bit whi ch is over here you can see that the cat is sleeping on the top log like this if you still don’t see it here’s its head so Gemini 3 has incredibly impressive visual capabilities.

Creating Functional Web Apps with Canvas

Let’s feed it another super tricky prompt which honestly only GPT5 could kind of get correct all the other models could not really do this well create a clone of Photoshop with all the basic tools include brushes layers edit history filters blending options and more put everything in a standalone HTML file and then again for coding tasks I’m going to turn on canvas so we can preview this in a right side window. Let’s press run all right here’s what we get let’s start with brushing over the canvas so the brush works let me try to change the color to something else and brush over it again and then let me also change this to blue this time and then increase the size but decrease the hardness and you can see it does decrease the hardness in other words the edge is a bit softer now let me add a new layer and then her e let me brush over this again and let me select some other colors now if I adjust the opacity of the layer here it actually adjusts the opacity as you can see here plus if I toggle this layer off it indeed toggles the layer off. Next let’s see if eraser works so let me try to erase this green stuff and that also works .All right next let me add a new layer and then what I’m going to do is actually open an image and let me insert something like this and then I’m going to move this image down here so it’s under layer 2 in fact let me select this layer and delete it cuz we don’t really need it. And then for the image what I’m going to do is let’s try to grayscale this so that works let me press Ctrl +-z to undo it let’s try to invert this apparently invert doesn’t really work let’s try sepia so sepia works let me undo this let me try blur so blur works as well let me undo this and then I’m going to select layer 2 and let me draw some additional stuff over layer 2 okay and then let’s set the blending mode to something else like multiply so that works screen works and then overlay works darken also works this also works i mean all these blending modes just work right out of the box this is very impressive how many settings it offers me in just one prompt. Now to be fair GPT5 could also generate something like this these are like the only two AI models out there that could generate a working Photoshop clone with all these settings in just one prompt. All right here’s an even trickier prompt which again most of the top AI models cannot get correct make a visual simulation of a beehive construction showing hexagonal cells forming worker bee paths and honey storage include sliders for colony size and resource availability put everything in a standalone HTML file. All right and here’s what we get so it does start off with like two cells and then the bees do go out to forage so let’s wait for them to come back and the bees are coming back and they are filling up the cells with hone y as you can see from the yellow colors so everything is actually you know physically accurate plus the bees are also flying really realistically and the bees are actually going to the cells that need to be filled up. So as you can see the bees aren’t hovering to the cells which are already filled next let’s increase the colony size and then let’s also increase the flower abundance so we can have this colony form faster everything just works and everything actually looks correct like if you look at some other top models like Miniax or Kim 2 there are a lot of noticeable errors with their generations. The only model that was able to get this correct was GPT5 as you can see here so Gemini 3 is definitely state-of-the-art.

Professional Claude Skills

Advanced Gaming & 3D Scene Generation

All right next let’s try to get it to code up some games let’s try to get it to create a space shooter game where I can fly through asteroid fields dodging debris and firing lasers at alien invaders make it visually appealing with particle explosions use publicly available assets put everything in a standalone file so here it says we can use the WD or arrow keys to move and then space to shoot. All right so let’s click initialize and let’s shoot the damn asteroids so everything works i can use my arrow keys to move around and I can shoot the asteroids and then here’s some alien invader notice that as I shoot more aliens or asteroids then my score at the top left corner does increase so the score works. Next let me try to die so I’m going to get hit and as you can see my health bar at the top right goes to zero and then it’s game over so there you go gemini 3 could easily create a fully functional game from just one prompt. Now because Gemini 3 can analyze images what I’m going to do is drag and drop this image into the prompt and then get it to code a beautiful 3D scene from that use 3JS which is a library to create 3D assets in a single HTML file and here’s what I get. How cool is that it actually was able to generate this image but in a 3D scene now the d etails aren’t perfect but this is already really good compared to the other models plus it even added some nice animation of Sakura pedals falling so with Gemini 3 you can easily just upload an image and get it to create 3D assets.

Pushing the Limits: Ray Tracing & Limitations

Now again I have really high expectations for Gemini 3 so here’s an even trickier prompt testing its ray tracing abilities so develop a real-time ray tracing simulation featuring not one but two metallic spheres suspended above a street scene use any publicly available 3D street view environment and allow adjustable parameters etc etc. Let’s press generate all right Gemini 3 is good but it’s not that good yet. So why I chose two spheres is because I wanted to see if the spheres would actually be reflected in each other but you know if I rotate this scene it doesn’t look like the spheres are actually being reflected on the other sphere also note the shadow of the sphere is not correct it should not have a shadow here other than that let’s test out these differen t settings so it seems like this one is the left sphere let’s change the color to this so color works let me change it back to white and then let’s adjust the metaleness so this is zero this is 100 that works. Next let’s try to adjust the roughness so roughness also works very nice and then what about clear coat not sure what that does if I set the roughness to like an intermediate value and then I adjust the clear coat you can see it basically makes this a bit more shinier or polished. And then we have this glass setting which I’m not really sure what that does and then I or refraction again I’m not sure what that does or if it even does anything and then here is sphere number two let’s change the color to something else so color works and then metalness also works roughness also works clear coat also works. Here this glass setting is doing something and then I is also doing something now let’s try to blur the background and here’s what we get but if we blur the background notice that th e reflections in the spheres aren’t blurred as well if we increase the exposure then the reflections also respond accordingly so there you go it’s not perfect there are some noticeable errors with this but this is a really tricky prompt none of the other AI models could get this correct. Everyone’s talking about AI these days there are tons of AI tutorials out there showing how AI can do this or that but here’s the most important question how can you actually make money using AI. Well this free resource called the AI business playbook seven companies making millions by HubSpot will be really insightful inside you’ll see real stories of AI startups that turned simple ideas into companies making millions in annual revenue each case study breaks down how they started the problem they solve and how AI powered their success. Plus it goes into their actual numbers like revenue margins and growth metrics what I really like is that each case study includes a section on why it works plus a clear t akeaway so you’re not just reading stories you’re getting actionable insights you can actually use all these success stories actually reveal some common patterns that you can apply to your own projects to boost your probability of success.

Real-World Business Utility: Financial Analysis & Monte Carlo Simulations

All right next what I’m going to do is take the Q4 report from Amazon Google and Nvidia and I’m going to upload these into Gemini and then get it to create a comprehensi ve financial analysis report now I’ve already done this with the other models and they can handle it fine so to make it more challenging I wanted to use advanced algorithms to suggest price forecasts providing rationale and confidence intervals. Let’s see if it can pull this off and here’s what we get notice that I didn’t specify in the prompt that this is like from Google Nvidia or Amazon so it’s actually going through each PDF and analyzing the data. So here’s an executive summary here are some comparative visuals and then some detailed financial metrics let’s do a quick fact check to make sure these numbers are actually correct. So let’s try to find the operating income from Alphabet all right so here’s the original PDF and then for operating income in 2024 this is indeed 31,000 million which is indeed 31 billion. And then down here here’s the really impressive part about this it actually did this pretty complex Monte Carlo forecast and it simulated future stock price trajectories using this geometric brownie in motion so here’s the price for Amazon and you can see a ton of simulations and then here is the average line i can like adjust these settings further so let’s do something like this and then run simulation and here’s what we get. Let me like adjust these even further and here’s the result next let me pull up Google and then let me also increase this a bit. All right and then decrease this so really cool you can adjust all these settings and then this algorithm would create a ton of simulations and then give you the median plus the confidence intervals really cool.

Productivity & Research: UI Builders, Geolocation, and Medical Inquiry

Next let’s get it to develop a drag and drop UI builder like Figma include snap to grid and alignment guides and advanced settings and here is our result. So let’s select an element let’s just select this one indeed I can drag this around to resize it and notice the numbers up here actually change when I resize this same with the X and Y dimensions over here i can also change the color of this to someth ing else let’s try to change this text i can change the font of this i can select the alignment of this and then for the button here again I can select different colors and all these settings just work. Let me add a rectangle here so that also works and then a circle that also works here we have another button i can also insert an image and then I can place a URL over here. So let’s do this and then I can also disable or enable this snap grid and then down here I can choose to export this to HTML. Next let’s test its ability to guess a certain location so I’m going to upload this photo which I have never uploaded online i’ve stripped all the metadata from this plus this is not even the main view this is kind of a side view of the scene. It should be extra hard for it to guess where exactly this is for the prompt I’m just going to ask it to give me the exact location all right and here’s what I got based on the visual evidence this is middle Jer Lake which is correct that’s pretty crazy so this hike consists of three lakes and indeed this is the middle one now because it can analyze images of course you can also just get it to do homework for you. So let’s upload this image where we need to fill in the blanks and then for the prompt I’m just going to write fill in the answers and here’s what we get. A should not be the endopplasmic reticulum a should be the cell membrane b it got correct c it did not get correct c should be the endopplasmic reticulum d should not be cytoplasm. So a lot of these are actually wrong so for you students out there unfortunately you can’t just upload your homework and just get it to do it for you at least not yet. All right next let’s try to get it to do some medical research for the prompt let’s get it to assess the evidence for meniscus tear recovery in young adults compare surgical versus non-surgical outcomes and summarize rehab phases with pain and mobility tracking graphs. And here is what we get so here is the evidence for surgical versus no n-surgical on the Gemini app by default web search is enabled so it’s actually able to search the web and then site relevant links throughout its answer as you can see here. And then here is a summary comparison table it also offers you the option to export to sheets here we have rehab phases and then visualizing recovery. Because it doesn’t have like access to any graphing software it decided to just generate this graph for me using plain text which is actually very impressive. And then here’s another chart showing mobility and functionality and then here are next steps.

Fact-Checking and Hallucination Tests

Next here’s a hallucination test how do I use control nets in stable diffusion 5 the correct answer is SD5 does not exist yet we only have up to 3.5 so let’s see if it can call this out. Here’s what I get and indeed it correctly identified that there is no SD5 we only have SD 3.5. So that sums up some of my preliminary tests with Gemini 3 i tested the hell out of it with some really tricky prompts that other AI models could not get correct but surprisingly Gemini 3 was able to handle most of these very well. This is definitely the most performant and capable model I’ve used so far. Next let’s go over where you can use this so you can just use this on the Gemini app once you log in make sure you select this thinking mode down here to actually use Gemini 3 Pro otherwise it’s just going to result to fast which I assume is using 2.5 Flash. In addition to the Gemini platform you can also use Gemini 3 in Google’s AI Studio i’ll link to this in the description below as well and this is basically a platform which allows you to use a ton of Google’s models all in one integrated interface. Now here’s what the homepage looks like and you can see there’s already a ton of buttons for you to try Gemini 3 like over here or over here. What I’m going to do is click chat with models and here’s what the chat interface looks like. At the top here is where you can select Gemini 3 as well as other models like Nano Banana or Imagine or even texttospech generators and then you would use this just like a regular chatbot so down here is where you can enter your prompt. Notice that AI Studio offers a bit more customizability compared to the Gemini app you can specify system instructions over here which is basically like an overarching prompt that defines you know the role of your AI model. In addition to the prompt down here you can select the temperature which basically controls the randomness or creativity of the output. So a lower temperature value would give it more deterministic or safe answers but it might be too repetitive and then a higher temperature would allow it to be more creative and diverse. And then this also allows you to adjust the thinking level now if you choose low it’s going to run faster but at the sacrifice of some intelligence or performance and then vice versa. So those are the two main places from Google where you can use Gemini 3 of course there are also a ton of thirdparty providers that have already added Gemini 3 to their platforms.

Specs and Data: Gemini 3 vs. The Competition (Benchmarks)

All right next let’s go over the specs of Gemini 3 Pro as you can see here this has a context window of 1 million tokens which is actually the same as 2.5 Pro. So they didn’t increase the context size but it’s still larger than the context window of the other leading models. For your reference the context window is basically how much information you can fit into your prompt at once so for 1 million tokens this is roughly like 700,000 words or basically a novel or a small to medium-sized codebase or because this is multimodal this is also roughly like 1 hour of video. Now this is closed source we don’t actually know the architecture or the parameter count of this. Now here’s the crazy part about Gemini 3 Pro it absolutely crushes the other AI models across almost all benchmarks as you can see here. So for humanity’s last exam this is basically testing an AI’s knowledge on some really obscure scientific subjects notice that it got 37% which is like wa y higher than Cloud 4.5 or GBT 5.1 and then for Arc AGI 2 this is absolutely crazy. This basically tests its ability to solve visual puzzles here’s an example question for your reference so first it’s given a question and an answer and then it’s given a new question and it needs to figure out the answer. So this is testing an AI model’s ability to actually learn new patterns and figure out the correct answer. Now for humans this is pretty easy to do we can figure out the pattern so for example you basically need to color the gray blocks according to how many holes it has. However the problem is for AI models after training they don’t actually learn new things and that’s why for even the top AI models out there they get an extremely low score on this ARC AGI benchmark. But Gemini 3 Pro was able to get 31% which is like way higher so this indicates that it does have some ability to pick up new patterns or learn new things even after training. I mean to show you how insane Gemini 3 Pro is for t his Arc AGI 2 benchmark it’s all the way over here like it’s not even close. And then for GBQA Diamond this is like graduate level science questions again it just scored the highest same with competitive math and all these other benchmarks like coding or agentic use like it just dominates the other competitors across most of these benchmarks. Now these are just Google’s self-reported benchmarks so let’s also look at some independent evaluators. Here’s a leaderboard by artificial analysis and as you can see Gemini 3 Pro is currently ranked number one beating the previous leader GPT 5.1 high but it’s only leading by like three points which is kind of surprising. I was expecting an even larger gap and then if you look at the price of this Gemini 3 is actually quite expensive but it’s still cheaper than Claude or Grock. And I mean this is the most performant model out there so of course it’s going to be pricier than the other less performant models gemini 3 is also the best in terms of coding v isual reasoning and outputting accurate answers in fact this is like 14 percentage points more than second place. Here’s another leaderboard by Abacus AAI called LiveBench and again Gemini 3 Pro is ranked number one but again not by a lot. In fact it seems like its coding and agentic coding is not as good as GPT5 at least according to this benchmark. If you look at another benchmark called SimpleBench which basically tests AI models on common sense questions again Gemini 3 Pro is ranked number one. In fact it has a much higher lead than the other models and it’s actually edging extremely close to the human score. So anyways those are some benchmarks for your reference and that sums up my video on Gemini 3. Let me know in the comments what you think of this and what other cool or impressive things were you able to get it to do.

Leave a Comment

Comments

No comments yet. Why don’t you start the discussion?

Leave a Reply

Your email address will not be published. Required fields are marked *