hey guys it's Sasha let's dive straight into this one open AI says it has evidenced China's deep seek used its model to train competitor open AI has just gone to the financial times and given them this tiny little bit of an exclusive interview saying that they are upset that deep seek the Chinese AI startup has apparently used open ai's data to train their model and I have a few Choice words that I want to share some unfiltered opinions about open Ai and about every other big tech company who does the same stuff and if you're
easily offended or there are kids nearby then maybe watch this video after the kids have gone to bed so in this breaking news story the San Francisco based chat GPT maker told the financial times it had seen some evidence of distillation which it suspects to be from Deep seek the technique is used by developers to obtain better performance on smaller models by using outputs from larger more capable ones allowing them to achieve similar results on specific tasks at a much lower cost the issue is when you take it out of the platform and are doing
it to create your own model for your own purposes said one person close to open AI so open AI are very annoyed it appears because somebody had the audacity to go and train their model by essentially taking data from open AI apparently this method of distillation is essentially just making your own model go to talk to chat GPT ask chat GPT millions and millions of questions to train the model itself right rather than going to all these websites C cating all the information just go and ask chat GPT for all the answers open aai declined
to comment further or provide details of its evidence its terms of service state users cannot copy any of its services or use output to develop models that compete with open AI you can go and read the rest of this article in the financial times the article and the rest of the thing is behind the pay wall I personally do pay for the subscription it's up to you but here is the thing openai is saying that it is against their terms and conditions against their terms of service for other companies to come and use their content
to make their own models for their own commercial gain well I never I have some big [ __ ] news for you Mr samman every other website in the world also has the same exact Clauses in their terms and conditions you know the websites that you scraped every other website in the world is also protected by copyright they also say that you cannot just take their data and do whatever you want with it for commercial purposes and for some weird reason that did not stop open AI from coming and taking all of that information which
is expressly not allowed to be used for commercial purposes taking that information to build their large Lang language models and then make loads and loads and loads of money billions upon billions of dollars in value of the company so open AI built their entire business their entire product on the back of stealing everybody else's data literally the exact thing that they've done if you went and did hours days weeks of research put valuable information on your website you know loads of stuff from your own personal experience open AI would just turn up come along scre
scpe the whole website absorb all of that into their training data set and then sell the regurgitated version with all of your research to their customers and without stealing all of this data it would be impossible for open AI to ever build their product because they would have had to pay trillions of dollars to get access to the information in the first place Sam ultman basically acknowledged this many times in interviews in the past and last year satian Adella the CEO of Microsoft who owns 50% of open AI said that anything that can be accessed
on the open web is fair gain he said that if information is available on your website then it is in the public domain apparently according to Tech Bros of course that is not true it is complete horseshit and you are not allowed to just randomly turn up to a website copy all of their stuff to then go and use it for your own commercial purposes to make money on it that's illegal every developed country has very strict copyright laws that expressly say the exact opposite to what people like satian Adella are saying but Sati Adella
is a chump and he only cares about enriching himself and his buddies and the shareholders of Microsoft so he thinks it's perfectly fine to go and steal other people's data if it helps you to get richer except it's not fine when it happens in Reverse it's not fine when other people do it to you apparently over the weekend deep seek released their model and according to deep seek they trained their whole model on a fraction of the hardware that open AI users and for just $6 million which is orders of magnitude less than what it
cost open AI to train their models apparently this R1 model by Deep seek is about as good as the best O Series models by open Ai and open AI is now crying and complaining loudly and all their friends on social media complaining loudly because they are saying that after they spent all of that money to build chat GPT the Chinese just came and took their data and built their own product and they're selling it and it's much cheaper oh no I tell you what I love a good case of [ __ ] around and find
out what do you think did all the Publishers do before you came and stole their data in the last year Google has destroyed hundreds of thousands of Publishers by stopping sending them traffic and instead showing preferred websites websites that they have commercial deals with and their own AI generated answers trained on the data of the websites that they have since destroyed those websites also collectively spend huge amounts of money spent huge amounts of effort built loads of personal experience years of time put into colleting and creating that information and now that the information has been
taken there is no longer any need for those websites so Google has basically deleted all those websites off the face of the Earth and present their own product instead hundreds of thousands of small and mediumsized online businesses no longer exist as as far as Google's concerned as far as Google's concerned from March last year those websites have just been wiped they're getting no traffic at all instead of sending you to somebody else's website Google can keep you on their own platform and show you more ads this way they make more money an open AI did
exactly the same thing so when open aai Microsoft Google all these other people steal everybody's data brazenly then it's okay it's fair game that data is in the public domain because you can access it in new website but when everybody else does it to open AI When anybody else does it to open AI it is really bad evil thing to do how dare they let me get the tiniest of tiny violins and play a very little sad song while the executives at open AI cry about this one I am sorry but I have zero sympathy
I just do not care one iota for the dweebs at open Ai and every other company that is doing this with their large language models you live by the sword you die by The Sword and I hope you die very very quickly by that sword if data that can be accessed on a website can be used by you to make money without paying the people from whom you stole it well then the same exact thing can work against you too and it should it should and I really hope it continues to in fact deep seek
probably paid you for the data by buying a boatload of tokens most of the website users whose data was scraped did not get paid any anything not one cent unless those people are deep seek maybe manag to create Millions upon millions of free accounts and then use the free quotas I really hope it is that latter bit I really hope they paid open AI absolutely nothing because that would just make this even sweeter it would make it way better the guys who plagiarized and stole from everybody else on the whole internet are now complaining that
somebody else is stealing their stolen stuff from them it is like running to the cops and saying that somebody just stole the car that you stole from somebody else last night the funniest thing is that open Ai and the rest of the llm space has this whole Army of tech bro Fanboys defending everything that they do it is just nauseating when somebody complains about the fact that open AI stole their data these Tech bro dewis will come out of their dark shadowy corners and say that this is how the internet works man copyright doesn't apply
it's just so Antiquated your terms and conditions don't apply the fact that you do not consent for Sam Alman to turn up and steal your information and do whatever he wants with it to become a multi-billionaire that's irrelevant and if you disagree is because you're sour is because you're a dinosaur is because you don't get it well who's sour now how does it feel when the shoe is on the other foot how does it feel when somebody comes and says [ __ ] you to your copyright [ __ ] you to everything that you've worked
for years to produce and makes a cheap cop to replace you and destroys your business in the process oh does that not feel so nice oh no what a shame what was that you want your copyright protections you want the law to punish people who break your terms and conditions you want those people to go to prison you want to be fairly compensated for your work is that what you want Sam the word that you're looking for is karma karma has turned up and dished out a giant can of whoopass and smacked you square in
that smug face of yours I hope that another dozen Chinese tech companies come and do the same exact thing and devalue these llm grifters all the way to zero you went straight up the line of fafo and you are now beginning to find out the fact that governments around the world in the United States in the UK elsewhere are in cahoots with these Tech Bros is actually kind of sad the US is busy announcing project Starcraft or Stargate or starfish or whatever it is the UK government is apparently considering changing hundreds of years of copyright
law to allow AI companies to come and steal other people's proprietary data because it's the future bro and I'm sure all of this is not in any way whatsoever linked to gross corruption greed and personal interest now let me get my extra large popcorn and wait for the inevitable Tech Bros crying in the comments see you guys later