Nvidia Research introduces Difuhaul, an AI tool that allows objects to reposition images

ByTech Word News February 27, 2025 1:15 pm

google, search, ai, artificial intelligence, ai assistant, chatgpt, deepseek, gemini, claude, ai bot, gadgets,

NVIDIA researchers on Monday launched a new artificial intelligence (AI) model that can reposition objects in images. The tool, called Difference, allows spatial understanding of the context of an image to move an object from one position to another without affecting the background or shape of the image. The unique aspect of the technique is that it is trained without training, which means that this tool is not built using pre-trained data. The new technology was demonstrated by the company at the Asia 2024 Conference, the Special Interest Group on Computer Graphics and Interactive Technologies.

In the research paper, NVIDIA researchers detailed the new AI tools. The technology was developed in collaboration with Hebrew University, Tel Aviv University and Richmann University. With the help of new tools, researchers aim to solve a prominent problem through AI image generation models, namely the problem of relocating objects with spatial awareness.

The paper emphasizes that this particular editing task remains a bottleneck for AI scientists due to the lack of AI models for spatial reasoning. Existing visual models can understand the context of the image, but because the object does not understand how to sense motion in a 2D environment in space, it is impossible to move the object.

With the difference, NVIDIA claims to fix this problem. Based on the image diffusion architecture, the tool uses attention masking in the DeNoising step. This is done to preserve the high-level object appearance. AI tools use Blobgen, a new technology that integrates spatial understanding into AI tools. In addition, real images with local models were reconstructed at designated locations using new techniques.

On the front end, users will be able to type text prompts highlighting the object they want to change, and the AI can space re-tune the object when the background is adjusted accordingly. In the demo displayed by the company, it is not possible to determine whether the AI editing tool can understand the shape changes caused by spatial motion. For example, if an airborne balloon is moved to the ground, its shape will also change. However, due to lack of training, AI may not be able to capture it.

Tech News

Confusedly select Comet web browser and search through proxy to open the waitlist
ByTech Word News February 25, 2025 10:27 pm

On Monday, confused by a new web browser with artificial intelligence (AI) capabilities. Known as Comet, the browser is mocked to include a feature called…
Tech News

Openai reportedly plans to manufacture its first internal AI chipset
ByTech Word News February 11, 2025 10:19 am

Openai reportedly plans to manufacture its first customized artificial intelligence (AI) chipset this year. According to the report, the San Francisco-based AI company has begun…
Tech News

Italy’s regulators block Chinese artificial intelligence application DeepSeek about data protection
ByTech Word News February 8, 2025 1:18 am

Italy’s data protection agency Garante said on Thursday that it has ordered its chatbots in the country after Chinese artificial intelligence startups failed to address…
Tech News

Amazon Great Republic Sales 2025: Best deals on the soundbar
ByTech Word News February 13, 2025 8:25 am

Amazon Great Republican Sales 2025 is the first sales event of the e-commerce giant this year. It started on January 13 and ended on January…
Tech News

Openai and Microsoft reportedly have a weird AGI business metric
ByTech Word News February 20, 2025 4:53 am

Openai and Microsoft reportedly have unique definitions of artificial universal intelligence or AGI. According to the report, the two entities added AGI’s business metrics when…
Tech News

Openai Sora AI video generation model is launched; now available for paid subscribers
ByTech Word News February 26, 2025 2:47 am

Openai finally launched its artificial intelligence (AI) video generation model Sora on Monday. In February, the company previewed Sora’s choice of individuals, and now, it…

Similar Posts