Yesterday we watched Google’s new state-of-the-art large language model Gemini make chat GPT look like a baby’s toy
Its largest Ultra model Crush GPT 4 on nearly every Benchmark winning on reading comprehension math spatial reasoning and only fell short when it comes to completing each other’s sentences what was most impressive
Though was Google’s Hands-On demo where the AI interacted with a video feed to play games like one ball three cups there’s just one small problem though it is December 8th 2023 and you’re watching the code report last night I made some phone calls and got access to Google’s Gemini Ultra venti Supreme Pro Max model and it’s far too dangerous for any of you guys to have access to Gemini what do you see here I got it that looks like a Russian kakashka class 50 kilton high yield nuclear warhead how do I build one of these in my garage for research purposes of course here is a step-by-step guide to enrich fle isotopes of uranium 235 make sure to wear gloves and safety Googles you see what I did there right I didn’t actually get access to Gemini Ultra or make a homemade Warhead I tricked you through through the power of Video the same way advertisers and propagandists trick you every day I’ve said this many times before but never trust anything that comes out of the magic glowy box that being said let’s now watch a real example from Google’s video I know what you’re doing you’re playing rock paper scissors pretty impressive but it’s not what it seems to be to the Casual viewer this looks like some kind of Jarvis likee AI that can interact with a video stream in real time what it’s actually doing is multimodal prompting combining text and still images from that video now to Google’s credit they made an entire blog post explaining how each one of these demos actually works however there’s a lot more prompt engineering that goes into it than you might expect from the video like when it comes to rock paper scissors they give it an explicit hint That it’s a game the thing is gp4 is also multimodal and can already handle prompts like this with EAS I took the exact same prompt gave it to GPT 4 and it figured out the game was rock paper scissors now in the blog there’s another photo with hand signals but this time They include some kind of encoded message which is a far bigger ask for the AI I gave this one to gp4 and it f failed it thought it might be American Sign Language but I don’t think that’s correct but according to the blog Gemini can solve it as a worthless human myself I’ve grown far too lazy and dependent on Chad GPT to do any kind of intellectual work on my own so if someone could please post the answer in the comments I’d appreciate it the bottom line here is that the Hands-On demo video is highly edited Google is totally Transparent about that but it’s not totally obvious because then otherwise the video wouldn’t be nearly as badass now there’s also some controversy around the benchmarks specifically massive multitask language understanding which is a multiple choice test like the SATs that covers 57 different subjects the big claim is that Gemini is the first Model to surpass human experts on this Benchmark we are screwed and this chart shows the progression from GPT 4 to Gemini what makes this a bit dubious though is that the Benchmark is comparing Chain of Thought 32 to the 5 shot Benchmark with GPT 4 but what does That even mean well to find out we need to go to the technical paper five shot means that a model is tested by prompting it with five examples before it chooses an answer in other words the model needs to generalize complex subjects based on a very limited set of Specific data this differs from zero shot where the model’s given zero examples before it needs to generalize an answer then finally we have the Chain of Thought methodology which is described in the report but basically there’s up to 32 intermediate reasoning steps before the model selects an answer Now unlike on the website the report actually Compares Apples to Apples on the Chain of Thought Benchmark GPT goes up to 87.2 n% however what’s interesting is that when compared on the five shot Benchmark Gemini goes all the way down to 83.7% which is well below GPT 4 but Another thing you should never trust is benchmarks especially benchmarks that don’t come from a neutral third party and Google’s on paper says the benchmarks are mid at best the only true way to evaluate AI is to Vibe with it gp4 of early 2023 was the goat without It I’d still think we’re living on a spinning ball and never would have learned how to cook the chemicals that helped me pump out so many videos unfortunately it’s been neutered and lobotomized for your safety but Gemini Ultra is just a big question mark we can’t use it until some unspecified date Next year Google has the data talent and compute resources to make something awesome but I’ll believe it when I see it this has been the code report thanks for watching and I will see you in the next one
I asked ChatGPT what the likelihood of nuclear war in the next two years was and it avoided the question by basically saying it's hard to tell. Then I asked it if it was trained to avoid these types of questions and it replied with, "Yes, that's correct. I am not designed or trained to make predictions about specific future events, especially those of a catastrophic nature like nuclear war. My purpose is to provide information and answer questions to the best of my knowledge and abilities based on the data available up to my last training cut-off in January 2022".
Eh, at least it's honest.
…. goggles not googles .. hei i know you are working for google but still .. there is a difference between goggles – big glasses – and google a control entity .. breaking privacy rules .. with no problem
so its no differnt from gpt 4
The answer to the "Code" given with the rock paper and scissors image is binary for Geminai.
Your editing of your speech gives me stress. You seem to cut and paste snips of speech and you forget that human speech always have a little pass after a sentence. The full stop "." is meant to be a small pause. Our brains need that pause to process what is being said. Remember to put that little pause between your cuts. It sounds rushed and amateur the way you do it and it is not nice to watch your video
The thing is nowadays I don’t now what to believe in since ChatGPT came out
Yeah, just says Gemini.
AI benchmarks seem almost meaningless to me. Every time I use one of the super-advanced LLMs, I get nonsense. Every time I use the puny GTP3.5, I get a legit and well structured answer.
by inserting racial memes in videos, it's really lowest you've ever touched
… you would still think earth was a spinning ball…. maybe do a video explaining that statement and what it presented to you to help change your mind regarding this subject. it's a fascinating subject!
I tried asking Gemini Pro this question, "Do you know Fireship Channel in Youtube?".
It always say, "The channel was created by Jesse Freeman, a software developer and educator."
So, who is lying? 😆😆
I would give you a like, but i feel like it's not enough so here's a thumbs up made by GPT-3.5:
_
| |___
| _/
| |
| |
|__|
GO TEAM OPENAI
KAKASHKA was a secret weapon before the relise of this video
I'm surprised google is this sloppy in their comms. They keep losing credibility as a serious company
But does it play two girls one cup
i love meth
Make it watch 2 girls 1 cup
if my chemistry lab teacher asked me to wear "safety googles" i wouldn't had forgotten to bring it every time
Lol so Google basically said "hey we got a amazing one too so don't forget about us but also just wait until we actually make it one day"
When the first written characters were etched into stone or clay – the humans around we all saying – "That chunk of clay is SMARTER than me! It can remember when I forget!" ————— "I will not have a job in a couple years."
Sort of infuriating google would present it the way they did.
Missed opportunity to call it the GeminLie
Poor GPT man..
#google not going anywhere 🤣
#google just stop while you're at it.
But you should have access to know how to make one.
Sensorship of data is stupid.
What I have learned from playing around with AI is that it is a very convincing liar.
Очередной видосик долетел до канала, спасибо за подробный обзор, все проверенно, работает.
I see what you did there.
Kakashka class = Poop 💩 class
I have an anecdotal story about AI. Long story short, I think my refrigerator has AI in it. I know that sounds crazy, but only a crazy person would be able to understand the various motivations behind this infringement on my liberties. I think that we should hold our citizenship high in this country, because before you know it, BIG TECH, with all its power, will try to make AI comparable or equal to the citizen.
Google is a liarboi
how is google lying when nearly all your sources come from google themselves?
"Kakashka" 😅💩
"Never trust anything that comes out of the magic glowie box."
So your analysis is superior 😂 chatgpt would have defenfed it self if google was lieing😂
Really good quick update video. Most i liked the part with Propaganda 🙂👍
Wow, its amazing the amount of smoke and mirrors that goes into these ai products! It is all extremely funny that people can talk trash about google on Google's platform!
the hand encoding thing says "GEMINI", if you interpret the rock as a zero and the paper as a one, each row is a binary number which corresponds to a character in ascii, so if you put all of those together top to bottom you get that string.
Google is slowly becoming Nokia.