OpenAI just released ChatGPT Deep Research, a new tool that uses the full o3 model to complete compl...
Video Transcript:
okay so opening I came out with deep research and I think this might be the most significant release over the last year and oh boy look I'm aware that I say this sometimes maybe a bit too often but they keep escalating the features I think for most people this one will be even more useful than operator as it's literally an AI agent that goes out there searches the entire web and comes back with a report for you and it's the first full implementation of O free so this really really is a significant release and it's not just that it looks good it performs extremely well me and the team have been playing with this matter of fact we already maxed out opening ey Pro account on the very first day of this coming out and in this video I want to briefly show you what this is about how to use this and what use cases you can expect out of this opening as deep research let's begin so first things first this is their brand new feature that they only shipped to the Pro Plan as of now yes that is the $200 plan unfortunately and even on that plan that is Unlimited in most other functionalities you only get 100 deep researches a month the good news is that Sam Alman on X already pointed out that they will be expanding this to the plus tier and that will come with something like 10 Deep researches a month that is in the future though and he also pointed out that they even want to bring that to the free tier but what we're looking at right here is the full version of Deep research and here's the key component to this function this thing uses o free full under the hood not o free mini not o fre mini High o free full and the way you use it is like a basic GPT 40 prompt all you have to do is enable this deep research button and if you give it any sort of prompt it will come at you with several follow-up questions and once you answer those you can simply separate them by commas or do it in a bullet point list you can send that message and it will begin its deep research so in this first step it only uses 40 and then it engages this deep research agent that uses o free and a variety of tools under the hood so this is not just a reasoning model this is really a reasoning model with access to the internet so it has something like a Bing search engine at its disposal and it also has the ability to write and execute python code to organize all of that data and a combination of those and a reason model will work through your problem as you can see it right here in the activity it will find various sources on many of these reports that I created it finds dozens of sources for you and then it compiles all of that with the power of the most potent reasoning model in the world which is or as of right now into a very long report for you now let me tell you that doesn't just sound interesting it also really is this ofre reasoning combined with these tools is really the first agent that is very accessible to everybody who can afford the $200 priz tag and the way I came to think of this in my first days of the usage is that running a prompt through chat GPT is similar to a Google search especially when you enable the search feature you can also think of this as giving an intelligent assistant that is very well read a computer with access to the internet and some instructions and then they have 5 minutes to complete the task and that's super useful that's why all of us have been using chat GPT deep research on the other hand when you switch over to this it's also like handing off a task to an assistant but this assistant isn't just well read it's also a very thoughtful assistant that is extremely good at planning and critical thinking and this assistant is not going to spend 5 minutes on your task he's going to spend three hours on it maybe even more in some cases so let's think about that if I gave an assistant a task like this like hey research all the different YouTube cameras for me sure they would follow up with these questions I would answer them but then if you've ever done some product research on the internet you might know what the process usually looks like you go out there you find different blog posts different YouTube videos you find different comparison tables you find all the opinions you can you consume all of that content and then if you want to go to extra mile you create some sort of Excel sheet that creates an overview but this takes some time I mean think about this if you want to consider 21 sources in your report 21 different articles and you actually want to read them and consider all the information in there just finding those sources will take you 15 minutes and then reading through them will take maybe another 45 minutes at least and then you have to make sense of it in this case with deep research it took me the time that I needed to write this simple little prompt I had to follow up and then I suppose it costed me 100th of the Pro Plan which would be $2 if you want to calculate it like that of course there's many other benefits there but let's just do that so it cost me like a minute and $2 versus well I think it be fair to say that this might be a 2hour research task and I picked this one specifically because I'm actually very well educated on all the different Sony models for YouTube production and I got to say these are excellent recommendations oh I wanted to say it's missing the a74 but no it's actually right here and look it doesn't just end with the cameras it also went a step further obviously if you want an interchangeable lens camera you will also need a lens and let me tell you this is my YouTube recommendation for so many people the sigma 16 mm with an apsc size sensor just works like a charm in my case I use something a bit more zoomed in something around the 40 mm range which would be this one on a apsc or this one is an excellent recommendation too this one works too let me tell you I have domain expertise on this stuff and this is an amazing report I couldn't do a better job if I spent 2 hours researching this for a friend of mine that asked me about this and again all it took me was about a minute of time and $2 to save 2 hours on this task look I'm trying to look at this neutrally but this really feels like an unlock and an advantage over everyone who does not have access to a tool like this cat GPD is already an advantage but having this deep research thing at your disposal because it doesn't just end with product research let's talk about some other use cases that pointed out here this combination of the world's best thinking model internet search and running python code is just something that is a real real time save maybe even more so than operator for most people right now I'm not sure thinking about that one right now but at this point I want to give you some usage tips because the neat thing about this is that it uses GPT 40 to engage this meaning you can engage deep research but you can also upload files images this is the latest one from the O free mini release video so I could do something like attach an image of the different benchmarks that I pulled together and showed you in that video and then I can tell it to research the internet and look at different user reviews and opinions on ofre Mini deep seek R1 Gemini thinking 2. 0 and 01 Pro and instead of me going ahead and scouring the internet for all of this information I can just send this and all of us sudden I'm using an attached file image in this case in combination with text I give it some followup prompts and then it's going to run it so you have this essential feature that I know that so many of you value which is attaching files and working with them like look at this it just took 10 minutes to complete this research it used 26 sources and here is a comparison of O mini deeps R1 Google Gemini 2. 0 and 01 Pro for business use cases and then look at this thing this is no joke everything is cross referenced with the different articles where it pulled information from I'm still rolling by the way talks about the benchmarks it talks about users reviews and feedback and so much more and the wonderful thing about pulling in this many sources is that it combats the number one problem with AI outputs the fact that they're super general not personalized not opinionated pulling in Reddit opinions on all fre mini custom prompt engineering here is the antidote to generic AI outputs so that's big and let me show you some parts of this blog post that I want to highlight that they published with this first of all there's this one that everybody keeps talking about which is humity last exam this is evaluation that includes some of the hardest questions you could imagine matter of fact they even ran a competition we highlighted that on the channel back in the day where they asked all the people like give us your hardest problems will included in Humanity's last exam so basically these are supposed to be the hardest expert level questions that you can come up with and if you look at the performance of different models on this particular benchmark gbd4 scores 3.
3% something like deep seek R1 and 01 scored 9. 4 and 99.