Saturday, April 13, 2024
HomeArtificial Intelligence NewsApple's latest AI technology surpasses GPT-4 in vision capabilities! Check out the...

Apple’s latest AI technology surpasses GPT-4 in vision capabilities! Check out the new Apple AI.

So we finally have some news from Apple regarding their machine learning SL llms in terms of what they’ve finally been developing. Apple has introduced a multimodal AI system that is pretty impressive because it does actually exceed GPT 4’s capabilities in some regards. This might be the scenario that many have been looking at when they say that GPT 4 is no longer the king. Let’s take a look at exactly what Apple has introduced and how good this new multimodal AI system really is. Let’s take a look at how this system works. It’s called feret, so this is essentially the feret model and it’s based on the research by Apple researchers who created it. Essentially, it’s mainly a vision model. First, it uses a tool called clip viit l14 to understand what’s in the picture and then turns it into a form the computer can work with. Secondly, it also looks at the words you give it and converts them into a format it can understand. Then, it identifies areas in the image. If you talk about a specific part of the picture, like a cat in the bottom left-hand corner, the model uses special coordinates to find exactly where that is in the image. Of course, we do have processing and shapes features, and it’s really smart in dealing with different shapes in the picture, not just simple boxes. It looks at many points in the area you’re talking about and understands the details and locations of each point. Finally, it brings together this information to accurately find and describe the specific part of the picture you’re talking about.

Essentially, what we have here is a really impressive advanced image identification model that, when on certain benchmarks compared to GPT 4, does exceed GPT 4’s vision capabilities. So, you can see here, first of all, there are some benchmarks that you may want to look at. On the benchmarks for the feret model, we can see that feret actually has all of the input types, which are point, box, and free form. It also has very good output grounding, which essentially just means that it can understand exactly the relationship between certain objects in the image and what they actually do in the real physical world. Then, of course, we have on data construction and GPT generate and robustness, and of course, the quantitative evaluation of refer SL ground with chat. This is very interesting because, in this section of the paper, they didn’t actually compare it to GPT 4 with vision. They compared it to GPT 4 Roi. But later on in the paper, I will show you that compared to GPT 4 with vision.

If we take a look at GPT 4 Roi, we can see here that it says GPT 4 Roi instruction tuning large language model on the region of interest. Essentially, what GPT 4 Roi was was specifically a fine-tuned version. In the benchmarks of the PDF, I’m guessing that the researchers likely tested against GPT 4 Roi instead of GPT 4 vision. GPT 4 Roi is specifically designed for understanding and interacting with the regions of interest in images, which is a more advanced and specialized task than what GPT 4 vision might be designed for. GPT 4 Roi’s ability to combine language and detailed image analysis, especially focusing on specific areas within images, makes it a more suitable benchmark for testing the feret model’s capabilities in fine-grained multimodal understanding and interaction. This comparison helps to highlight the advancement and specific strengths of the feret model in handling complex vision tasks.

In the paper, they actually did say that, on the other hand, GPT 4 vision is more knowledgeable in common sense. For example, it can further highlight that the exhaust pipe can reduce the noise. GPT 4’s enhanced linguistic capabilities are much more advanced. In regard to grounding, feret does excel at identifying most traffic lights even in cluttered scenes. Nevertheless, feret shines, especially when precise bounding boxes for grounding are needed, catering to those applications that require pinpoint accuracy in smaller regions.

If we compare GPT 4 Vision to Apple’s new multimodal feret model, it’s clear that feret excels in accurately identifying small and specific regions in images, particularly in complex scenarios. GPT 4 can recognize areas outlined in red or specific in text but tends to struggle with smaller regions. Whereas GPT 4 vision is knowledgeable and effective in general knowledge question answering related to the image regions, feret stands out for its precision in pinpointing small areas, filling the crucial gap in detailed image analysis.

Furthermore, Apple has been actively acquiring a range of artificial intelligence companies in recent years with the aim of enhancing the AI and machine learning capabilities of its products and services. These acquisitions have allowed Apple to tap into the expertise and technology of these companies to develop advanced AI and machine learning capabilities for a range of applications. One such feature is the rumored Apple GPT, a language model similar to GPT 3, which aims to enhance Siri’s virtual assistant capabilities and other AI-powered features on Apple’s products. With a heavy focus on machine learning, Apple is committed to staying ahead of the curve in the technology industry, driving innovation and pushing the boundaries of what’s possible with this technology.

In conclusion, Apple’s advancements in machine learning and AI have made significant strides in recent years, with the introduction of the feret model showcasing their commitment to pushing the boundaries of what’s possible in AI. With the rumored Apple GPT on the horizon, it’s clear that Apple is not resting on its laurels but instead continuously striving to be at the forefront of technology and innovation in the AI space. Exciting times lie ahead for Apple and AI enthusiasts as we anticipate the groundbreaking developments that will continue to shape the future of AI technology.

Leah Sirama
Leah Sirama
Leah Sirama, a lifelong enthusiast of Artificial Intelligence, has been exploring technology and the digital realm since childhood. Known for his creative thinking, he's dedicated to improving AI experiences for all, making him a respected figure in the field. His passion, curiosity, and creativity drive advancements in the AI world.


  1. when siri came out it was the best, dough bad, i can imagine apple could make siri again the best as implementing it with the apple apps is easier than if google has to implement it with samsung apps and oppo and so on…also everyone uses their own different android launcher. and theres a gazilian androids and most of them have too weak a processor to use Ai whereas all iphones of last 3 years have some Ai chip

  2. But what we have to state clearly: Yes apple is maybe behind Samsung and Samsung has things first. But! Apple don’t wanna do things first – apple wanna do things right. I had Samsung etc phones for ages and had soooo many problems! Now is switched to Apple and never had a single problem since what apple releases – works!

  3. Apple has no business being in AI as far as public interest goes. Apple as a company has attempted and succeeded in a number of instances in monopolizing aspects of it's industry. A powerful monopolized AI system controlled by a powerful corporate tech company such as Apple would guarantee a stranglehold on nearly everything we do in our day to day and would be even harder to stop or regulate.I find Apple to be the prime example of corporate fascism within the tech industry. AI needs to be transparent to all, owned by none and taught by humans with loving hearts, smart and healthy minds and that value humanitarian pursuits above all. Not human beings filled with greed to meet their own self interest in a grossly unregulated capitalistic society. If anyone finds AI to be frightening, remember that AI is only the reflection of our own frightening selves that taught and built AI. Everyone needs to accept and see their own flaws and shortcomings and find healthy ways of fixing that. Possibly through therapy that works on mental health and emotional wellbeing. Then at that point we may be worthy of training AI. That needs to happen sooner than later for the sake of us all.

  4. I think of all the exciting things AI can do my main concern is Hallucination. Even the smartest person in the room who understands AI to its deepest core realizes this is a very dangerous issue

  5. Why nobody consider bard ? 😂
    It actually understood what the what the shock absorber is (even if I posted a shitty picture taken form the TV …)
    Apparently Bard vision is pretty good

  6. Why not ask GPT the exact same question that the other language models were asked in reference to the bike shock absorber? You said highlighted which can refer to where the picture receives the most light since the box is not a highlight. The original question did not say highlighted. It is enclosed in a yellow box ie [region0] in the same color font as the box in question and lets you compare the results on par honestly by using the exact same question. So copy and paste using same font color for [region0]…What is the purpose of the object [region0] on the bike? The same applies to the [region0] and [region1] in red font enclosed in red ovals. Using the same font color and the same text within the same question used on the actual image might have improved results like it did with ferret.

    Using white font over white background makes it invisible which lessens the effect intended for emphasis for the sake of the video. Also, proof reading videos before releasing them so they are not filled with text errors that were not the same as what you said will only strengthen your reputation. Choosing that emphasis method to emphasize your entire speech instead of just the part needing actual emphasis filled with text errors was a poor way to relay the information. You could have just talked and not used the font and it would have been clean, better and less distracting. "Ladies and gentlemen, Apple has finally to make their entrance into the generative AR space." Everything that followed was a mess.

  7. Why do they have a woman 👠♀️ as chat gpt and its not even real robot 🤖 just an image they used. I get that its not year 3,000 but i think the images should be made into real robots not just an image.

  8. No doubt Apple is having problems something people have been talking about since October, "Apple shares fell more than 3% after Barclays downgraded the stock and trimmed its price target, saying weakening iPhone 15 sales were likely a warning sign for iPhone 16 sales and broader hardware projections." – CNBC

  9. I think of all the exciting things AI can do my main concern is Hallucination. Even the smartest person in the room who understands AI to its deepest core realizes this is a very dangerous issue. There must be a real way to mitigate this. Also, how will the various AI’s models compete against each other. In the end Evil versus Good. I think the most important part of AI is finding stratigic medical applications, treatments and ultimately cures. Aside from this my other worry is how AI would affect military operations. I’m sure when this is applied to battlefield combat/defense it would be of great concern. Lots to ponder over.

  10. ★: I believe we are meant to be like Jesus in our hearts and not in our flesh. But be careful of AI, for it knows only things of the flesh such as our fleshly desires and cannot comprehend things of the spirit such as true love and eternal joy that comes from obeying God's Word. Man is a spirit and has a soul but lives in a body which is flesh. When you go to bed it is the flesh that sleeps, but your spirit never sleeps and that is why you have dreams, unless you have died in peace physically. More so, true love that endures and last is a thing of the heart. When I say 'heart', I mean 'spirit'. But fake love, pretentious love, love with expectations, love for classic reasons, love for material reasons and love for selfish reasons those are things of the flesh. In the beginning God said let us make man in our own image, according to our likeness. Take note, God is Spirit and God is Love. As Love He is the source of it. We also know that God is Omnipotent, for He creates out of nothing and He has no beginning and has no end. That means, our love is but a shadow of God's Love. True love looks around to see who is in need of your help, your smile, your possessions, your money, your strength, your quality time. Love forgives and forgets. Love wants for others what it wants for itself. However, true love works in conjunction with other spiritual forces such as patience and faith – in the finished work of our Lord and Savior, Jesus Christ, rather than in what man has done such as science, technology and organizations which won't last forever. To avoid sin and error which leads to the death of your body and your spirit-soul in hell fire (second death), you must make God's Word the standard for your life, not AI. If not, God will let you face AI on your own (with your own strength) and it will cast the truth down to the ground, it will be the cause of so much destruction like never seen before, it will deceive many and take many captive in order to enslave them into worshipping it and abiding in lawlessness. We can only destroy ourselves but with God all things are possible. God knows us better because He is our Creater and He knows our beginning and our end. The prove texts can be found in the book of John 5:31-44, 2 Thessalonians 2:1-12, Daniel 2, Daniel 7-9, Revelation 13-15, Matthew 24-25 and Luke 21.

    You must read your Bible slowly, attentively and repeatedly, having this in mind that Christianity is not a religion but a Love relationship. It is measured by the love you have for God and the love you have for your neighbor. Matthew 5:13 says, "You are the salt of the earth; but if the salt loses its flavor, how shall it be seasoned? It is then good for nothing but to be thrown out and trampled underfoot by men." Our spirits can only be purified while in the body (while on earth) but after death anything unpurified (unclean) cannot enter Heaven Gates. Blessed are the pure in heart, for they shall see God (Matthew 5:8). No one in his right mind can risk or even bare to put anything rotten into his body nor put the rotten thing closer to the those which are not rotten. Sin makes the heart unclean but you can ask God to forgive you, to save your soul, to cleanse you of your sin, to purify your heart by the blood of His Son, our Lord and Savior, Jesus Christ which He shed here on earth because Isaiah 53:5 says, "But He was wounded for our transgressions, He was bruised for our iniquities; the chastisement for our peace was upon Him, and by His stripes we are healed". Meditation in the Word of God is a visit to God because God is in His Word. We know God through His Word because the Word He speaks represent His heart's desires. Meditation is a thing of the heart, not a thing of the mind. Thinking is lower level while meditation is upper level. You think of your problems, your troubles but inorder to meditate, you must let go of your own will, your own desires, your own ways and let the Word you read prevail over thinking process by thinking of it more and more, until the Word gets into your blood and gains supremacy over you. That is when meditation comes – naturally without forcing yourself, turning the Word over and over in your heart. You can be having a conversation with someone while meditating in your heart – saying 'Thank you, Jesus…' over and over in your heart. But it is hard to meditate when you haven't let go of offence and past hurts. Your pain of the past, leave it for God, don't worry yourself, Jesus is alive, you can face tomorrow, He understands what you are passing through today. Begin to meditate on this prayer day and night (in all that you do), "Lord take more of me and give me more of you. Give me more of your holiness, faithfulness, obedience, self-control, purity, humility, love, goodness, kindness, joy, patience, forgiveness, wisdom, understanding, calmness, perseverance… Make me a channel of shinning light where there is darkness, a channel of pardon where there is injury, a channel of love where there is hatred, a channel of humility where there is pride…" The Word of God becomes a part of us by meditation, not by saying words but spirit prayer (prayer from the heart). When the Word becomes a part of you, it will by its very nature influence your conduct and behavior. Your bad habits, you will no longer have the urge to do them. You will think differently, dream differently, act differently and talk differently – if something does not qualify for meditation, it does not qualify for conversation.

    Heaven is God's throne and the dwelling place for God's angels and the saints. Hell was meant for the devil (satan) and the fallen angels. Those who torture the souls in hell are demons (unclean spirits). Man's spirit is a free moral agent. You can either yield yourself to God or to the devil because God has given us discretion. If one thinks he possesses only his own spirit, he is lying to himself and he is already in the dark. God is light while the devil is darkness. Light (Holy Spirit) and darkness (evil spirit) cannot stay together in a man's body. God is Love (Love is light) and where there is no love is hell, just as where there is no light is darkness. The one you yield yourself to, you will get his reward. The reward of righteousness to man's spirit is life (abundant life) and the reward of sin to man's spirit is death. Sin and satan are one and the same. Whatever sin can cause, satan also can cause. Sin is what gives the devil dominion or power over man's spirit. When God's Word becomes a part of you, sin power over you is broken, you become the righteousness of God through Christ Jesus. Where Jesus is, you are and when He went (to the Father), you went. In the book of John 8:42-47, Jesus said to them, “If God were your Father, you would love Me, for I proceeded forth and came from God; nor have I come of Myself, but He sent Me. Why do you not understand My speech? Because you are not able to listen to My word. You are of your father the devil, and the desires of your father you want to do. He was a murderer from the beginning, and does not stand in the truth, because there is no truth in him. When he speaks a lie, he speaks from his own resources, for he is a liar and the father of it. Which of you convicts Me of sin? And if I tell the truth, why do you not believe Me? He who is of God hears God’s words; therefore you do not hear, because you are not of God.” My prayer is, "May God bless His Word in the midst of your heart." Glory and honour be to God our Father, our Lord and Savior Jesus Christ and our Helper the Holy Spirit. Watch and pray!… Thank you for your time and may God bless you as you share this message with others.

  11. This is a horrible hodgepodge of stitched together previous videos. This channel has always come across as AI generated. However, this video, I would call AI degraded! BTW, I’m a fan of AI, but only when it enhances things. If I were in charge of this channel, I would take this video down immediately. It’s not flattering to the company.

  12. Can you please just proofread the subtitles in your videos? They're sloppy and full of typos. Your videos are helpful but I'm going to unsubscribe because they bug me and make me doubt your info.

  13. To be fair, GPT4 can at least 1-shot that motorcycle suspension question 🙂 Shot 0 it identifies the muffler. Tell it the muffler is below the box and ask it to try again to identify what's inside the box, and it was spot on at that point.

  14. AI/Synthetic/Biologic/Humanoid/Robotoid Clones/?/ & DoD/Sentient World Simulation/?/ & Smart Dust/Motes/Micro-Electromechanical Sensors/Tagging/Tracking system/?/ – duck duck go!

  15. 4:50, it would be nice if we can send the same image/prompt to multiple different Vision GPT models At The Same Time to get different outputs At The Same Time rather than just having 1.

  16. Even my very basic problem solver gpt gave me the answer easily, The highlighted region on the motorcycle is the shock absorber or suspension system. Its primary purpose is to absorb and dampen shock impulses from the road, which helps to ensure that the motorcycle's wheels stay in contact with the road surface for better traction, control, and comfort, providing a smoother ride. The suspension system also protects the motorcycle and the rider from the potential damage and discomfort caused by rough terrain.

    Confidence Score: 100%

  17. Apple, like X and Meta, actually has a data, hardware, and software moat. Given the recent NYT lawsuit, it seems like having access to large, company owned, multi-media data may be what wins the race.

  18. Jesus is God & He loves you

    Jesus will soon be seen by all men, women, and children in the clouds. Jesus is returning now! Believe and be saved.

    Exodus 3:14 (God speaking)

    And God said unto Moses, I AM THAT I AM: and he said, Thus shalt thou say unto the children of Israel, I AM hath sent me unto you.

    John 8:58 (Jesus speaking)

    Jesus said unto them, Verily, verily, I say unto you, Before Abraham was, I am.

    John 10:30 (Jesus speaking)

    I and my Father are one.

    Isaiah 9:6

    For unto us a child is born, unto us a son is given: and the government shall be upon his shoulder: and his name shall be called Wonderful, Counsellor, The mighty God, The everlasting Father, The Prince of Peace.

    Matthew 1:23

    Behold, a virgin shall be with child, and shall bring forth a son, and they shall call his name Emmanuel, which being interpreted is, God with us.

    John 1:1 & 14

    1 In the beginning was the Word, and the Word was with God, and the Word was God.

    14 And the Word was made flesh (Lord Jesus), and dwelt among us, (and we beheld his glory, the glory as of the only begotten of the Father,) full of grace and truth.

    John 8:24 (Jesus speaking)

    I said therefore unto you, that ye shall die in your sins: for if ye believe not that I am he, ye shall die in your sins.

    John 14:9 (Jesus speaking)

    Jesus saith unto him, Have I been so long time with you, and yet hast thou not known me, Philip? he that hath seen me hath seen the Father; and how sayest thou then, Show us the Father?

    Hebrews 1:1-3, & 8 (God calls His Son "O God" because Jesus IS God in the flesh)

    1 God, who at sundry times and in divers manners spake in time past unto the fathers by the prophets,

    2 Hath in these last days spoken unto us by his Son, whom he hath appointed heir of all things, by whom also he made the worlds;

    3 Who being the brightness of his glory, and the express image of his person, and upholding all things by the word of his power, when he had by himself purged our sins, sat down on the right hand of the Majesty on high;

    8 But unto the Son he saith, Thy throne, O God, is for ever and ever: a sceptre of righteousness is the sceptre of thy kingdom.

    1 John 5:7

    For there are three that bear record in heaven, the Father, the Word, and the Holy Ghost: and these three are one.

    Titus 2:13

    Looking for that blessed hope, and the glorious appearing of the great God and our Saviour Jesus Christ;

    Revelation 1:7

    Behold, he cometh with clouds; and every eye shall see him, and they also which pierced him: and all kindreds of the earth shall wail because of him. Even so, Amen.

    Isaiah 44:6 (God speaking)

    Thus saith the LORD the King of Israel, and his redeemer the LORD of hosts; I am the first, and I am the last; and beside me there is no God.

    Revelation 1:8 (Jesus speaking)

    I am Alpha and Omega, the beginning and the ending, saith the Lord, which is, and which was, and which is to come, the Almighty.

    Revelation 22:13 (Jesus speaking)

    I am Alpha and Omega, the beginning and the end, the first and the last.

    There Are None Righteous / How To Be Saved

    Romans 3:10 & 23

    10 As it is written, There is none righteous, no, not one:

    23 For all have sinned, and come short of the glory of God;

    Luke 5:31-32 (Jesus speaking)

    31 And Jesus answering said unto them, They that are whole need not a physician; but they that are sick.

    32 I came not to call the righteous, but sinners to repentance.

    1 Peter 3:18 (The word “quicken” means “to make alive”)

    For Christ also hath once suffered for sins, the just for the unjust, that he might bring us to God, being put to death in the flesh, but quickened by the Spirit:

    Romans 10:9

    That if thou shalt confess with thy mouth the Lord Jesus, and shalt believe in thine heart that God hath raised him from the dead, thou shalt be saved.

    Acts 4:12

    Neither is there salvation in any other: for there is none other name under heaven given among men, whereby we must be saved.

    Ephesians 2:8-9

    8 For by grace are ye saved through faith; and that not of yourselves: it is the gift of God:

    9 Not of works, lest any man should boast.

    Repent of your sins or suffer the consequences. Lord Jesus died in our places personally to take the death punishment that sin deserves and then resurrected by the power of God. Believe this and sincerely repent of your sins each time you sin and you will have eternal life and nothing to fear. Fail to repent and you will end up in the Lake of Fire.


Please enter your comment!
Please enter your name here

- Advertisment -

Most Popular