Yesterday, Nvidia Research published a blog post highlighting numerous advances in AI research throughout 2024. Some of these advances are very typical generative AI achievements. Still, there are things a little more interesting than simply generating content using someone else’s copyrighted material. For example, StormCast and MaskedMimic support more advanced weather forecasting and motion repair (reconstructing complete motion from partially visible motion). respectively for robotics.
There is also GluFormer, which uses AI to predict blood sugar levels up to four years in advance, but this requires past blood sugar monitoring data. GluFormer also makes it easier to determine how dietary changes affect long-term blood sugar levels, and studies have shown high accuracy for people with conditions up to and including diabetes. I am.
Some of the other touted improvements are a bit less impressive than in other fields of generative AI, and some are ethically questionable. However, they still represent the evolution of technology. “ConsiStory” enables multiple AI image prompts with a consistent theme, increasing utility for users looking to create narrative coherence with these tools.
watch on
On the other hand, “Edify3D” and “LATTE3D” are generation AI tools that can easily create 3D models. Existing 3D modelers don’t really like this. But they point out that AI retopology and UV mapping will be very useful for existing 3D art workloads without taking away all the fun and paid effort of creating the models yourself. There’s also “Fugatto,” a generative AI model for creating new sound files (including music) or modifying existing sound files (such as removing background music).
Finally, towards the end of the Highlights blog post, Nvidia Research summarizes a number of improvements and benchmark wins. These wins include Hydra-MDP, a self-driving framework that won CVPR 2024’s Autonomous Grand Challenge, Nvidia Blackwell’s superior performance in the MLPerf industry benchmark, and first place on the BOP leaderboard for model-based poses. Includes “Foundation Pose”. Invisible object estimation.
Nvidia links to related research papers for a more detailed overview of these advances in the original blog post.