10 Actually Useful Things You Can Do with OpenAI Operator

4.27k views6855 WordsCopy TextShare
The AI Advantage
I’ve been testing OpenAI’s Operator (along with the entire AI Advantage team) pretty much non-stop e...
Video Transcript:
look at this worked for 15 minutes today we'll be taking a closer look at opena brand new product operator this is the very first agent that remote controls your browser that actually works and I've tested all of these okay this is obviously a very potent idea that hasn't found the right execution yet and I'm not saying that operator is the endol be all product it's a research preview but open I only showed off a few use cases that are very basic booking a table booking a hotel this thing can actually already do way more than
that and in this video I'm going to show you what possibilities me and my team found inside of open AI operator like researching multiple websites and putting all the data into a separate spreadsheet or doing the same thing and creating a custom PowerPoint presentation or transferring data from one source to another one like a Google sheet or notion and finally we'll even look at a use case where operator is using another AI to get results that it couldn't get anywhere else on the internet before we dive into this one by one I want to share
two things the first one is that I understand that this is inaccessible to most people as it is behind a $200 pay wall but that's why people like me are here that are going to test this out for you and show you what is possible I think this product category is extremely promising and this is the very first version of the product that I have tried where I feel like I will be using this regularly there are several use cases from this video already that I saved to my tasks and that I will be using
regularly moving forward is that worth the $200 spoiler for most people no probably not never less I do believe in this category and that brings me to my final point before we get into the actual use cases which is that I wanted to go further in this video than open a I did in its presentation nevertheless I hope you find the use case that might make your life easier and with that being said let's get into the first use case that is possible today with open eyes operator that you might not have seen yet all
right so one of the categories that a lot of people have been curious about is this use case of using operator to research a specific topic now whether that's going to different blog posts and pulling the data together or it's actually finding the websites that it should be looking at this General use case of putting a AI agent in front of a keyboard and mouse to gather some data from the Internet is just something that a lot of people would find useful so I came up with a few test cases to see how this will
perform in practice let me start with the first one which is kind of the simplest and then we can kind of work our way up so this one is based off the comment of Colin Bailey that is plain and simple he said just tell us how we can make money and from all my research I have not found a direct way to make money with this yet I mean I guess there's some indirect ways but what I did is I told it to research five viable online business opportunities with low startup costs and then I
also wanted to estimate it startup costs with potential monthly revenue and I just ran this and it worked for seven minutes right here first up it just found random articles of a Google search that asked for online business opportunities not the most sophisticated method to do this I suppose but that's what you can expect then it opened them up successfully navigated all the way through the various cookies like clicked away from the ads things like that scrolled through the whole thing as it Scrolls it uses its Vision recognition the one in GPD 40 to capture
the information on this site and then although it did find two different articles to do this it decided to just take the data from this one which by the way was 17 use cases and it presented it back to me in this format which is five use cases I mean it did the things that I wanted startup cost potential monthly Revenue but rather than looking at multiple sites it just looked at one and rather than giving me all 17 it just gave me five of these so it worked but I don't know that's uh two
out of 10 maybe a three out of 10 on execution of this I like the fact that it used llm to produce some of this extra data as this was obviously not a part of the article so that's nice and shows a lot of potential here but this simple query I mean not really what you want to see out of this by the way this was one of the prompts that I also ran through un Fric prompt improver to see if a more detailed prompt like this that tells it about the goals the requirements action
steps really breaks it down into the various components performs better now turns out that the result is pretty much the same without giving me monetary values on Startup cost potential Revenue Etc so it's actually performed a little worse one interesting thing that occurred here was something that I saw in multiple operations and I'm quite impressed with this because it's really good at navigating some of these sites in this case it went to the entrepreneur blog used the search function and when it got to a error page it managed to navigate back out of it and
get back into searching successfully and then again it pulled up multiple sources but the results all come from one source that it doesn't even site so again a lot of potential for improvement independently of the fact how well written your prompt is okay so then I took a different approach because I realized that hey it did what I told it to right it researched a topic it searched for it it found the site so here's the same task but approached from a different angle the initial prompt I gave it here is quite simple analyze and
execute the following task scrape the data from 10 different blog articles on how to use open AI operator so I wanted to research different use cases or articles or any details on operator that I might have missed and this resulted in the longest working time that I found look at this worked for 15 minutes because here I told it to actively not just search for multiple articles but also to scrape the data from them so this seemed to have worked really well let's review the video of the entire operation here because the result that you
can see here is pretty much a summary of what you would have got from the open AI block post but pulled from all of these different sources so it opens up One Source goes to the Bing search opens up the next link opens up the next link opens up the next link and so on I think you see the point it just opens up all these various sources and then when it looks through them it pulls the data together and this is sort of where the task ended then I followed up with another prompt which
basically said summarize all the use cases into a Google doc and then it just needed a quick Google doc signin from me and it went on to compile this Google document that I created did it by itself and pulled everything together now sure it's just a copy paste job of the results over here but I really like the fact that it can do this reliably it can work with Google Sheets Google Docs spin up new ones as you'll see in other examples it can transfer data and things like that really quite reliably and then if
we review the video here you'll even see that in the end it always goes to the share settings and it changes them to share this with anyone with the link copies the link and successfully includes it so this part really works for me and also this data that it's cved from the 10 different sites is actually a mixture of the various results here making this a little more advanced and specific use case a real success in my eyes I mean you could do this yourself right you could pull up 10 articles you could copy everything
you could run it through llm to summarize it but that takes a lot of time that is not just booking a table at a restaurant or booking a hotel or something rather simple like that heck even operator needed 15 minutes to complete this task and if you can find two or three of these that you will run maybe not every day maybe once a week let me tell you that's a lot of time you can save here but nevertheless seeing something like this for me is a serious use case why operator might add to your
productivity okay I was actually ready to call it a day and give up on this use case and just accept the fact that the summaries will be this concise but over the next 48 hours we actually return to this one because I thought it was particularly impressive how operator navigated the internet and then just pulled together all the data it's just that the very limited length of the output was really hamstringing this so we ran a session in our community trying this out and together with some of the community members we came up with this
idea of actually telling it to save a summary after each article to a new Google doc with a simple prompt like this and it actually did it check it out this is the result of the 15 minute long operation it looked at five different articles and it saved a summary of each one to here this is way better than summarizing five different blog posts into one paragraph now I have one paragraph per blog post much preferable we tried to push this further but sort of failed I want to show you anyway so we adjusted one
thing which was we told it for every free screenshots I want you to add a summary of the written content in that screenshot to a Google doc and it did do this but this was super messy I would not recommend it it did work so this is a work around on how to make lengthy summaries with operator but this was the document that created right there and as you can see every sentence just starts with deep SE R1 the topic that we're researching and it's just I don't know it's just not that good so obviously
these are limitations that with time will be loosened up and there will be competing products but for now I wouldn't recommend this work around but this one is really incredible and then as you might have noticed there's a separate part to the prompt here and I actually use this in multiple prompts now and I want to dedicate a separate section to this so let's talk about it because this command really allows it to do a task from A to Z without asking your permission multiple times so in our community I already wrote up a little
guide on how to exactly do this step by step but I can sumarize that back here to you because it's quite simple if you run operation for the first time you will encounter it asking for permission on multiple steps so for example on this summary of the different blog posts where it's supposed to add a summary to Google doc like so it always ask hey I did one to find more articles should I continue searching and then you have to come back and manually say yes this defeats the whole purpose of this application and in
other operations it asks multiple times can I rename this image can I download this file can I run another search so what do we found in an office hour session that I run in the community is that if you specifically tell it what not to ask you for or it won't do it anymore so if you just generally say something like only ask for my confirmation if you're spending money it will ignore that and it will ask you anyway but if you say that and then follow up with if you need to find more articles
or continue searching move forward then as you can see this conversation becomes one prompt one result exactly what you want from it so I started including this building block into multiple prompts now so the general workflow is just you run through the manual process of telling operator that it is allowed to continue and then you take these little Snippets and reintegrate them into this building block that allows it to proceed independently of what it thinks now obviously this is just one little prompting trick and many of these are already appearing and will keep appearing and
that's really what the community is here for on a video like this I can show you what we've done but I can't keep you up to date on every little trick that we find or that other community members find so yeah I just want to point out if you're into this type of stuff if you want to get the most out of it I'm going to make the community the best place on the internet to learn how to use open eye Operator just because I'm obsessed with this myself and I hope this little snippet helps
you out in executing your own operations seamlessly rather than having to confirm every little action okay let's move on to the next one but something like this is definitely going to find a spot in my task that will run regularly cuz having a research assistant that googles all the newest articles on any given topic and then summarizes it back to you in a dock like this maybe not worth $200 but in my books is a real use case that I will be returning to regularly next up is a very simple one and a lot of
people asked if it could handle file so I just gave it this little test of uploading a picture of Kiana Reeves I don't know I just had this laying around on my desktop as one does and I told it to post this picture to the chat GPT subreddit now it needed me to log in and then it asked if it should use a different subreddit again I could use the prompt snippet here before and if I customize it and include some of the wording in here so if I say feel free to post on a
different subreddit if this one isn't available or automatically log in for me with my Google account and then you give it the credentials it will not ask these follow-ups Point here being is that it actually success L navigated to the chat GP subreddit created a brand new post after I logged in and encountered a barrier you need some Karma on reddits to post on a subreddit so then it moved away and went to the open AI subreddit and uploaded the picture there now it didn't go through with posting it cuz I don't want to be
sharing pictures of K Reeves on that subreddit but as you can see the whole point here was that it has this file manager and you could for example give it multiple images and it will share them across the internet now obviously this could be used to spam Facebook groups LinkedIn Etc but this is nothing that spammers haven't been able to automate with traditional tools like a web driver the difference is now this becomes a whole lot more accessible and you can move around digital files like this without much friction you literally just upload it to
a new conversation tell it to post it to a specific site you can even give it the URL and then you can save the task and whenever you need to do that you can just let it do all the clicking for you a million ways to use this I just wanted to demonstrate that it can store and then use files for you if you add them as a attachment to a brand new conversation okay so let's look at some data transfer use cases because this was not just very requested but also very interesting right having
can find the data in one place and then put it into your database or Excel sheet in another place obviously something that a lot of people have to do manually all the time and in many of those use cases using something like traditional automation just doesn't make sense because setting up the automation would took like 5 hours to automate away a task that might take 5 minutes but quickly prompting operator with the links to the Google spreadsheets that you need to work with is something that takes a few seconds and in return it saves you
5 minutes and then every time you need to adjust something it takes a few seconds to adjust something saves you multiple minutes that does line up again even experienced person doing this with code will take five or 10 times amount the time of just doing the damn thing manually and this is where operator shines cuz it's so simple and it figures out all the details on its own so this is the example I ask it to copy data from a Google sheet to another spreadsheet and then it asked me hey do you have the links
to the sheets and I actually created two Google Sheets for it and gave these to them and in this interaction they have a few follow-up prompts as you will see in the video so so for example it just asked me for permission to paste the data and then I followed up and wanted to create a visualization and a forecast for the next 10 years and to place the charts in a new tab so what you can see on screen right now is the entire process and parts of this worked really well parts of this didn't
work at all so copy pasting the data flawless I actually ran a few more examples of this every time it came to copy pasting something from one source to another it does really really well at these tasks so I can see myself actually using that by the way this data I don't know I literally just said the name data one data to data free but as you can see it created the chart successfully but with the forecast when it came to that it started struggling a little bit cuz I wasn't specific enough I realized that
now in hindsight I realized that I just told it create a forecast for the next 10 years but I didn't tell it how I wanted to forecast it so obviously you should forecast the amounts but do I wanted to create a separate column for the forecast or do I want it to extend the amounts like I didn't tell it and one thing that I really learned with all my testing of this so far is operator does not like to ask follow-up questions like ever unless it's about permissions or performing an action and it's just wanting
your approval of its actions it will not ask hey should I do it in the same column or should I use a separate column something that a real person would probably ask it just does things just like chat GPT which great in many cases but in a case like this I'm not even sure what exactly it's doing at this point work with Excel formulas I think you could get it to work if you really know what you're doing but if you just blindly tell it things like hey forecast this data or create some visualizations it's
going to try but I think it needs a little more Direction here from somebody who actually knows what technique would be used here in Excel to forecast or I guess you could use chat gbt on the side and ask it what formula you could use and then give it the specific formula nevertheless in the end I had to correct it and tell it hey the forecasting thing didn't really work and it just figured it out it just went cell by cell and created all of this different data with a little hiccup here in the end
but what it did successfully is create two brand new tabs one of them with a graph of the current data and one of them with a graph of the forecast which I consider a ESS it created the graphs it transferred the data forecasting didn't work so well data transfer something that you can absolutely use this for now here's another example of that and this one works between different applications so in the AI Advantage we used notion Aton basically the entire operating system for both the community and our content production engine for the YouTube ibuilt and
notion and right now we have 16 team members testing various operator use cases inside of a notion database called operator use cases now Philip from the team actually went ahead and he used operator to updates the operator use cases database in our notion automatically isn't that amazing so he tested the use case of using notion and then operator entered all the details as you can see in the screen recording right now so this works like a charm and as you might be able to tell I'm really excited about this because we have so many manual
tasks in notion with the community and products we created for example with the very first course that we did two years ago we put together a library of over a th prompts and in notion we created card views for them so it looked like a catalog but each one of those needed a picture so we needed to manually head on over to M Journey generate an image then move it over to notion and there was no API for these card covers so we just had to manually pick the M Journey pictures and upload them and
repeat that a thousand times with operator this would be really simple because actually this is a finding from another team member mik he founds that the best way to work with notion databases is actually in the card view it's really good at this button like layout so for any of you notion nerds out there like me if you create card views for your databases operator is going to be able to interact with them most reliably as you can see here table works too but card views work even better these are just the little tips and
tricks that we're finding on a daily basis and then res sharing to the community and the YouTube here so look some of this stuff is pretty powerful but it's not that straightforward right you might have to include an extra line in your prompt or you might need somebody like me in this case to make you aware of the fact that a specific thing is even possible with this this really does feel like the early days of chat GPT or GPT 4 when it came out where there were thousands of unex opportunities and the little prompt
snippet could really influence the way you work but I feel like here it's even more impactful because this thing actually does stuff doesn't just tell you how to do things and what I decided to do is to redirect our entire AI Advantage community in this direction I've always been saying that these assistants are just a precursor to agents and now that we have the first proper agentic product from one of the biggest players we're doing something we're calling operator February so we basically pivoted most of our events all of our guide production all the community
discussions to cover operator and to help anybody who's a member of the community get the most out of this product with a dedicated space where you can discuss different use cases and me and my team are sharing them just like I'm in this video in that space as we discover them plus there's discussions amongst members erupting underneath pinpointing different possibilities or routes that can be taken with an operator I really believe that this is the next evolution of AI and that here it starts getting so useful that this product will eventually get so useful that
the mainstream will not be able to ignore this anymore just like the chat GPT moment or now the Deep seek moment recently this will have its moment too the difference is this is not as intuitive as you need to know what it can do what are the limitations how do you get around those and my goal will be to make the AI Advantage Community the best place in the internet to educate you about that obviously a lot of it is also going to trickle down to the YouTube channel just like this video but I just
wanted to let you know that if you share my vision for this and you want to get as good as possible ad operator join me and all the other members in the AI Advantage community and let's explore the possibilities of new tools like this together all right that's my little Community promo that I really wanted to do this because it's it's going to be all about operator and similar products moving forward and now let's look at the next AI use case that you can do inside of operator today okay here's another one that I really
liked and this is really impressive because it was a one-hot prompt and it actually did all of these things in sequence so the prompt says compile a competitor analysis for the top five subscription based e-commerce Platforms in the pet supplies industry compare their pricing marketing strategies and customer reviews then create a concise five slide presentation in Google slide summarizing your findings and share it with insert GMA email address and let me tell you this one worked like a charm it found five different subscription-based pet supplies companies it looked at reviews and looked through their websites
for all the information we asked for and then it went to Google slides to create a brand new PowerPoint presentation so to say and look at this thing operating away I mean isn't this impressive you can get drafts of presentations on any Topic in the click of a button if you have the right prompt preset this goes Way Beyond the power level of even the best prompt inside of I mean come on this is amazing there's no denying that like this is where all the arguments like oh it can only book a table or a
flight I can do that myself go out of the window like come on five slide presentation from one prompt not bad we're just getting started here so yeah this works well and this is another one of those presets that I'll just have in my tasks and whenever I need new research on something this is a great way to do that just research a niche and create a presentation from it and literally one click and here's another one like that it's really good at these research tasks if you give it some source to store the data
so this one is also from philli and we'll put all the prompts here in the description below but basically he told it he was looking for affordable pocket camera and he wanted product link pros and cons and the pricing basically doing some rudimentary product research and as you can see it found all the different ones and look at it creating this spreadsheet ah isn't this just wonderful there you go that's the first one and we can just fast forward for all of this right no need to sit here for 3 minutes but it goes ahead
and successfully creates a spreadsheet with all the different links Pros cons the pricing and a personal rating that operator AK gbd z40 came up with all in one shot I want to add one note here actually Philip included a part of this prompt in the end he told you to think can to create a plan before it went ahead with the research it just completely ignored this this is just gbt 40 and it's just going to go step by step and do its thing I think sooner than later we're going to see operator with 01
or 03 or O3 Pro or competing products from Deep seek I guess come out and those will be a lot more calculated in its approach but it's incredible to see this just working with a onot prompt and the nice thing is you can run this promp multiple times and it works every time it's not like some of the competing products where it's just not reliable okay here's another quick one and this one is about sending messages to slack I wanted to highlight this one because it really shows how good operator is at solving some of
these problems I just want to be clear like it's not a perfect product it's a research preview and sometimes it does get stuck in a loop but usually with some clever prompting you can work around that and make it work and then you can save that clever prompting to the original task and have a preset that actually works so that's incredible in this case it's trying to use slack and it doesn't work as you can see it's stuck in this Loop where it keeps telling it that hey this site is blocked and it's not going
to work it does that a few times in a row but then it chooses another approach I'm not sure how it found this out but it just goes to app. slack.com then it tries the wrong approach a few more times but then it returns to app. slack.com and realizes that hey this works there's a web version of this application and I can just send my message like so and Philips operator successfully sends a gbd4 written message to our team like which is an amazing demo of how it just finds different ways of getting around problems
and this is just consistent amongst all the use cases that we looked at nice so you can use operator to spam your colleagues or loved ones on autopilot that makes me think that I might want to try out a use case where I do something like Gil foils AI for any Silicon Valley fans out there he had this bot one day where he was talking to its co-workers and it wasn't even him H after recording this video I might want to see if I can replicate that with operator okay here's another interesting use case that
was suggested by YouTube user with William Embry 57 y7 and and he asked if this could draw using any webbased cat programs this is basically a version of a 3D software that can run in your browser so this was another one of the examples where I approached it with a simple prompt and a more complex prompt the simple prompt basically said access a popular webbased CAD or CAD program and create a simple 3D model of a basic geometric shape document the process and capabilities all right so it went in and it basically only needed my
Google login at a certain point to create a new account on some of the software that it found but I can already tell you that this task did not work at all it actually found a bunch of these like Tinker cadens ketchup web vect Trace bline e- machine shop which by the way I was really impressed with it even went to some GitHub repost to find open- source cat software like this and while on some of them it managed to log in and others it needed my help none of these worked because all of them
require Hardware acceleration in other words they need help from your local computer because just the browser cannot run these by itself so as you can see on the screen recording here it it always gets to the point where it opens up the software and then some error message appears that hey you need Hardware acceleration for this to work and because this is just a simple virtual machine with very little compute backing it up and not a full-fledged computer none of these web applications worked nevertheless this was a really impressive one to test as it did
find a pleora of different applications I mean look at that and it figures out the interface of each one of them by itself I think on all the ones except the GitHub it managed to get to the user interface and then it basically refused to work on this one called figuro it actually clicked the different buttons and it tries going into the canvas and drawing something up and it refuses to coperate on this one I'm actually not sure if it's because of the hardware acceleration or it's because the software is used differently but nevertheless it
gave it a good shot now we're in the same task through the anthropic prompt improver which always does a good job with fleshing out the task into multiple steps Etc as you can see right here so it's just running the simple prompt through the prompt improver to create this this and it tells you the 12 steps to the automation including sub points and everything so in theory this should work better well that's at least what I thought but yet again this really elaborate prompt giving it all the details performed equally as well as the short
one matter of fact it found the same list of different tools worked for a total of 8 minutes whereas the other one worked for a total of 12 minutes if you just quickly scroll over the video it's pretty much the same thing it opens up the different pieces of software runs into error messages just to eventually give up on this task so anything that might need Hardware acceleration is not going to work on this because it's just a simple virtual machine where it barely has enough Hardware to run a browser anything like a 3D software
is not going to work on that okay and to round this video out I want to give you one last example and this one I picked as the very last one because it just clearly demonstrates the potential here that we haven't even talked about here the point of this video is showing you use cases that either work or break so you get a feeling for what it can do but I think the biggest potential here is operator actually working with other agentic tools whether it's other AI platforms like Claude or M journey in this case
or even beyond that we're testing use cases where it's using repet agent to build an entire website or where it even assembles its own automations these apps work CU they don't need Hardware acceleration like the 3D modeling software so in this case Dom which is responsible for the creative District in our community ran a bunch of mid Journey prompts that included in the original prompt automatically with operator he basically h a bunch of variations of the same prompt with different srf codes you just wanted to see what all the different styles would look like on
it and rather than sitting there and having to send it all manually Operator just did it all for him again this is a one-hot prompt without any follow-ups and it just did the damn thing and then you could keep playing with the prompt and download these images to the local file storage and then reupload them somewhere else although I have to say I tried this particular workflow generating an image saving it and posting it somewhere else the saving part sometimes is a bit tricky it's really good at uploading images but the saving step is not
as good as I would like it to be yet so what's the conclusion here well in my humble opinion after a few days of really intensively playing with this and kicking the wheels this thing has impressed me as opposed to many other AI releases which are very hyped when they come out and then very often they disappoint in practice this one was the opposite it actually came out and the whole deep seek story just overshadowed the release of this but in my opinion it is the most capable web navigating agent I've seen yet and I
think I tried them all let's see I'll run some comparisons again maybe I'll create a separate video if that's worth it but the point is it's just the best product on the market right now and I personally think that some of the use cases in this video are ones that I would actually find myself doing like I purposefully left out some things that I'm doing with it already and that I love like for example this one that orders a basic basket of groceries from a favorite Supermarket I run this once a month to get the
basics like mozzarella and bread that I like because those are things that people expect I tried to show you some things today that may go beyond what they showed off in their blog post but overall I would say it's been less than a week since this came out I would not want to give this product up anymore is it worth $200 hm I guess that one is really relative because if you're making $50 an hour then I do think that this can save you 4 hours a month you might have to play around a little
bit create your own presets but that seems realistic to me plus there's the upside of building skills that will transfer into the future other than that it's probably overpriced for most people at this point but I myself will stay on this journey on this course of trying to squeeze the most out of this and this is the first version of it where I'll be actually using this on a regular basis like why would I order my groceries manually if I have my wonderful preset here like I literally pressed this button yesterday and this cheese that
I really like just magically appeared at my door in around 1 minutes so if you think this could be useful to you and you want to learn more about this then I can only recommend you check out the AI Advantage community and we're also running weekly sessions where a team member of ours is sitting there with a copy of operator and if you want to run anything through it you can just leave a comment in advance or join that live event and our team will help you run some operations that might be relevant to you
all right that's that's all I got for today if we find more I'll be following up with more videos on operator I honestly think this is the main thing I want to be covering on this channel and with the advantage moving forward the future is here and I will see you very soon
Copyright © 2025. Made with ♥ in London by YTScribe.com