this week was truly a great week for the AI space as we had crazy new releases from anthropic which had just released a few new models and dropped their new computer use API where it essentially directs Claud to use computers the way people do by looking at a screen moving a cursor clicking buttons and typing text it is powered by Sony 3.5 and it's capable of automating various tasks I actually made a video showcasing all the ins and out of the computer used API as well as showcasing how you can install it locally so if
you're interested take a look at the video link in the description below but now there's a new way to integrate Cloud's computer used API into a proficient framework that we had showcased on this channel long time ago called open interpreter where it can practically control your computer in this demo video you can see that it's capable of reading and editing files having the ability to search the web and so much more this is the capability of having the computer used API linked with the framework like open interpreter now particularly in this demo it is applying
for a job within anthropics job board and you can see that it's going to be capable of taking screenshots of the current standpoint of the web page and it will then take the steps to fill out the like Fields like your first name last name your email address as well as your phone number based off of the resume that you had provided to open interpreter so in this case you can see that it is retrieving it within the terminal and it is slowly but surely filling out all the different fields for the people who do
not know open interpreter is a framework that lets LMS run code like python JavaScript shell and many other programming languages locally you can chat with open interpreter through this chat gbt like interface in your terminal which lets you create various sorts of files editing photos videos PDFs you can control a Chrome browser to perform research plot clean analyze large data sets and so much more now I've made a video on this so I definitely recommend that you take a look at it cuz it'll give you a better idea as to what you can do with
open interpreter so let's get started and showcase how you can install open interpreter and utilize the computer use API locally what you'll need to do is fulfill the prerequisites make sure that you have python installed as your programming language you'll need get to clone the repository onto your desktop vs code to configure API keys and lastly you need to make sure that you have an anthropic API key also please note this is only available for Windows as well as Mac OS currently not available for Linux so just keep that in mind so what you want
to do is open up your command prompt once you have opened it up click on the GitHub repository click on this green button click on copy the clipboard for the link of this repository and you can scroll down to this part of the repository now go back into your command prompt and type in get clone and then paste in the link for this repository you can then click enter this will start cloning this repository and once that's finished installing we can then go into the folder and then install the packages once you have finished cloning
you can then go into the directory by typing in CD open- interpreter once you're inside The Interpreter folder you can then install the packages so copy this and then paste this in this will start installing everything that is necessary once this is done we can then either Open up The Interpreter by using this command to access the ability uh abilities over here but in this case what we're doing is utilizing the OS computer used API so we would simply want to copy this command now there is different functionalities and different codes that you can use
to utilize different features OS mode is what's going to enable us to use a computer used API so once it has finished installing we can then paste this in now unfortunately if you're on Windows you might get this error over here and I currently have this error and I've tried my best to figure out a solution but at the moment I don't have the time as well as ability to fix this so you might have to wait a bit until Killian with a Creator behind this project Updates this package so that it is operational for
Windows now he is trying his best to find a fix and I'll definitely keep you guys posted but for the people who want to keep on continuing with the installation what you can do is just simply copy The Interpreter --os command and then you can just simply paste it into your command prompt and it should work right away it's going to then prompt you to provide an anthropic API which you can provide by generating an API key linked to a bilding account and then you can start accessing and utilizing this computer use API this is
probably one of the best examples of open interpreter now since I cannot actually showcase my personal demo here is a demo video where it is basically prompted to download a song that I found on Google and the person behind the computer is trying to transform it into an MP3 so you can see that it's taking the steps to download the YouTube video it's going to then convert the downloaded video into the MP3 format using ffmpeg now as you can see it's executing all of these commands on its own autonomously and as we see it go
further into the video it's going to then proceed to download the video for you so it looks like it has finished downloading the video and once it has finished downloading the YouTube video it had then converted it using ffmpeg so once it has now converted it it installed the new MP3 file and now as we go further into the video it'll showcase the new converted MP3 file and this is all being executed on its own this is the capability of this beautiful new computer use API being integrated within open inter RoR what people have been
capable of doing with the computer used API is just insane cuz people can utilize it for data Gathering where you can use the agent to navigate through websites to collect relevant information as to how you would with fir craw and this could even include data about products Services team members or even contact information and you can request it to take that data and then paste it into a file which is the capability that you get with the computer used API you have the ability to also have application process where it can then fill out as
well as intake forms and this would involve automatically inputting information that is collected from the website and possibly even generating responses to questions based off of what it is basically asking for this is the capability of what this real-time data extraction and automation process that you get from open interpreter having it linked with the computer your API also I haven't really done a video on interpreter in a long time but you have the capability of utilizing this terminal based assistant to basically execute various coding based tasks so in a way this is kind of like
AER but a little bit more focused on general purpose use cases so you can utilize this to execute various tasks and have it so that it could generate various components for you at the end of the day with the ability to connect the computer us API with open interpreter you get a more Dynamic use case as you have the functionality to utilize all the features that you would get with open interpreter with the computer use API now there's a lot more to this so I definitely recommend that you take a look at their documentation to
Showcase all the features associated with this framework now with that thought guys I hope you enjoyed today's video and you got some sort of value out of it I'm sorry for the error that I had basically encountered I will definitely keep you guys updated on a soltion if you're on Windows but with that thought guys I'll leave all the links in the description below make sure you follow me on the patreon so that you can access different subscriptions the AI tools completely for free make sure you follow me on Twitter a great way for you
to stay up to date with the latest AI news and lastly make sure you guys subscribe turn on notification Bell like this video and check out our previous vide so you can stay up to date with whatever is happening in the world of AI but with that thought guys have an amazing day spread positivity and I'll see you guys fairly shortly peace out fell