NVIDIA’s New AI Is Really Good At Moving Rabbits!

1.95k views784 WordsCopy TextShare
Two Minute Papers
❤️ Check out Lambda here and sign up for their GPU Cloud: https://lambdalabs.com/paper 📝 The paper...
Video Transcript:
If we have photo that is almost perfect,  but we wish to delete something, image inpainting techniques already exist  to help us with that. Like this. However, what if we don’t want to remove this dog,  only place it somewhere else?
Of course, that is still trouble. Look, we could inpaint the  part that we deleted, but the new part of course, does not look convincing at all. So, is that  impossible?
Nope, this new AI paper promises that it can do exactly that. I would very  much like to see that. Let’s have a look.
Dear Fellow Scholars, this is Two Minute  Papers with Dr Károly Zsolnai-Fehér. I am especially interested in this because  previous techniques could perform it, but…ouch. Not great.
When we wish to  move this puppy, they don’t appear to understand the relationship between  this good boy and its reflection. So, little doggie, time to move! And now, the new technique…goodness.
This is fantastic.  If I was given these images, I would be hard pressed to say whether they were changed.  Even the reflection moved to its new place, great job, maybe if you look very closely at  the splash, you might find out about the trick.
This is a wonderful collaboration between several  institutes that you see here. And I wonder what the key idea is here? The key is that we  have diffusion based text to image models, these can generate images for us, that is great,  but not all of them give us fine grained control.
Some of the ones in newer papers, however,  can do that. We can highlight regions with these blobs and say I would like to see a cat  here, a rock and a cloud there. Loving it.
So, our problem is simple, just move the blob,  right? Well, let’s see…nope, not quite. Fellow Scholars, do you see the problem here?
The  problem is that not only the blob changes, the whole image changes. That is too intrusive. So, this amazing new paper found something really interesting.
Let’s use a previous technique  and ask for a rabbit, and a cat. There we go, we got the rabbit, and we got a…well, that is  many things, but a cat it is not. Perhaps a hybrid of a rabbit and a cat.
And oh yes, therein  lies the problem. The information of the rabbit blob leaked into cat blob, and the cat has been  rabbitified. So, what does the new technique do?
Now hold on to your papers Fellow Scholars  and let’s ask for one rabbit and one cat, and, there we go. Less leakage. Finally,  the blobs are now independent.
Loving it. And that is one of the key ideas that helps  us move these objects in an already existing photo. Luckily, there are lots of examples  shown in the paper.
And I really like how it understands that some of the surroundings  have to change when moving the object, for instance, the shadow has to go too,  but everything else has to remain the same. It is a really tricky problem, and  this is a fantastic leap forward. Is it perfect?
No. Clearly not perfect. But when  showing the results to a bunch of humans, the new technique has a significantly  higher win rate against previous methods.
And in some cases, you can also move  not just the cat, but the rock too. Two objects. I can imagine a future paper where  every single object is recognized in the images, we can already do that quite reliably,  but then, also moving would be great.
You know what else would be great? Rotations!  Let’s see if it is any good at it.
Well, unfortunately it is not too good at that. So  what about resizing? Also not the best.
Small adjustments are kind of okay, but if we go  bigger, these artifacts appear. And finally, the most hilarious. When two objects  are moved to close to each other, what happens?
Oh my, the dog just  absorbed that piglet. What a pity. So, not perfect.
The paper also contains a ton  more comparisons against previous techniques, it is super fun, make sure to  check it out in the description. It works on a variety of images, that is  always a good sign that something would work well in practice, and just imagine  what we will be capable of two more papers down the line. I would like to make a  prediction for that: as you move the puppy, you will see real time updates.
Goodness, that  would be fantastic. What a time to be alive! So, what do you think?
What  would you Fellow Scholars use this for? Let me know in the comments below.
Related Videos
OpenAI o1: ChatGPT Supercharged!
7:13
OpenAI o1: ChatGPT Supercharged!
Two Minute Papers
98,961 views
DeepMind AlphaProteo AI: A Gift To Humanity! 🧬
5:55
DeepMind AlphaProteo AI: A Gift To Humanit...
Two Minute Papers
129,718 views
New AI Makes The Mona Lisa Come Alive!
5:38
New AI Makes The Mona Lisa Come Alive!
Two Minute Papers
46,358 views
NVIDIA’s New AI Did The Impossible!
9:26
NVIDIA’s New AI Did The Impossible!
Two Minute Papers
317,361 views
OpenAI’s ChatGPT Is Now Learning From Another AI!
6:08
OpenAI’s ChatGPT Is Now Learning From Anot...
Two Minute Papers
109,303 views
Blender 4.2 Is Here - A Revolution…For Free!
5:46
Blender 4.2 Is Here - A Revolution…For Free!
Two Minute Papers
221,664 views
OpenAI's ChatGPT Does Research… And Breaks Itself!
5:55
OpenAI's ChatGPT Does Research… And Breaks...
Two Minute Papers
110,278 views
Wow, World-Class AI For Free, For Everyone!
6:45
Wow, World-Class AI For Free, For Everyone!
Two Minute Papers
71,044 views
NVIDIA’s Tech: Finally, Real Time Ray Tracing! (Episode 900 Special!)
6:19
NVIDIA’s Tech: Finally, Real Time Ray Trac...
Two Minute Papers
133,414 views
OpenAI’s New AI: Being Smart Is Overrated!
5:49
OpenAI’s New AI: Being Smart Is Overrated!
Two Minute Papers
81,391 views
OpenAI’s DALL-E 3-Like AI For Free, Forever!
3:47
OpenAI’s DALL-E 3-Like AI For Free, Forever!
Two Minute Papers
79,443 views
Introducing Raspberry, an Open Source attempt to recreate Strawberry
29:12
Introducing Raspberry, an Open Source atte...
David Shapiro
29,333 views
NVIDIA’s New Tech Runs A Virtual City!
6:23
NVIDIA’s New Tech Runs A Virtual City!
Two Minute Papers
157,071 views
BYD destroyed the hybrid competition with Seal 07 DM-i!
8:31
BYD destroyed the hybrid competition with ...
MAD EV
7,294 views
DeepMind’s New AI Looked At 100,000,000 Examples!
5:02
DeepMind’s New AI Looked At 100,000,000 Ex...
Two Minute Papers
57,156 views
OpenAI Sora: Alternatives To Try Right Now!
3:55
OpenAI Sora: Alternatives To Try Right Now!
Two Minute Papers
30,404 views
Crazy New AI Learned To Rewrite Doom!
4:17
Crazy New AI Learned To Rewrite Doom!
Two Minute Papers
47,748 views
New AI: This Is A Gaming Revolution!
5:20
New AI: This Is A Gaming Revolution!
Two Minute Papers
106,908 views
Day9 : SQL Database Training: Clauses in SQL | Where | Order By |Aggregate Function | Group By
28:00
Day9 : SQL Database Training: Clauses in S...
InventModel
20 views
Adobe Illustrator Design || Create E 3D logo design
10:38
Adobe Illustrator Design || Create E 3D lo...
Classic TechArt
2 views
Copyright © 2024. Made with ♥ in London by YTScribe.com