2024 is shaping up to be a pivotal year for advancements in artificial intelligence (AI) and robotics. Major tech companies and research institutions have made significant strides in these fields, introducing innovative technologies that promise to revolutionize various sectors. This article provides a comprehensive overview of these developments.
Figure’s Autonomous Learning
Figure, a leading robotics company, has made a significant breakthrough with its Figure-01 AI. This system can learn to self-correct tasks by observing humans perform them. For instance, it can learn to make coffee by watching humans do so, marking a significant step towards end-to-end AI training.
Google DeepMind’s Mobile Aloha and Advanced Robotics Research
Google DeepMind, in collaboration with Stanford researchers, introduced Mobile Aloha, an open-source robotic system capable of completing complex tasks such as cooking and cleaning. This system is a testament to the potential of accessible and reproducible solutions for bimanual mobile manipulation.
In addition to Mobile Aloha, Google DeepMind has made significant strides in robotics research, introducing AutoRT for data collection, SARA-RT for faster transformers, and RT-Trajectory for better motion generalization. These advancements promise a future where robots can perform a wide array of tasks, demonstrating strides towards more capable helper robots.
DeWave: Translating Thoughts into Text
Researchers have developed DeWave, an AI system capable of turning silent thoughts into text by decoding brain signals. Remarkably, this system achieved over 40% accuracy in translating verbs directly from neural signals, without the need for invasive implants.
Apple’s Ferret and Rumored Siri Upgrade
Apple researchers have revealed ‘Ferret’, an open-source multimodal large language model (LLM) that can use regions of images for queries. Coupled with rumors of an upgraded version of Siri, Apple is poised to make significant strides in the AI race.
Alibaba’s Lifelike AI Avatar-Maker
Alibaba researchers have released a lifelike AI avatar-maker called ‘Mach’. This AI system can turn text prompts into realistic 3D avatars, using LLM and vision models.
UCLA and Snap’s Dual-Pivot Tuning
Researchers from UCLA and Snap have presented dual-pivot tuning, a new AI approach that leverages personal photos to customize image restoration models, better preserving individual facial features. This represents a significant advancement over existing generic techniques.
JPMorgan’s research team has introduced an AI model called DocLLM, designed to understand complex business documents. Impressively, DocLLM outperformed leading models, including GPT-4, by over 15% on some form analysis challenges.
OpenAI’s GPT Store
OpenAI has announced the upcoming launch of the GPT store, a new distribution platform for builders. The specifics of the revenue-sharing split are yet to be clarified.
Microsoft’s Copilot Apps and Keyboard Key
Microsoft has introduced new Android, iOS, and iPadOS apps for Copilot, along with a new Copilot keyboard key coming to new devices. GPT-4, DALL-E 3, Voice Chat, and Vision are now all available for free without the need for ChatGPT+.
Addressing Biases in LLMs
New research has revealed that LLMs perform better when prompted to act as gender-neutral or male rather than female. This underscores the importance of addressing biases that can creep into machine learning models’ training data.
In conclusion, 2024 is set to be a landmark year for AI and robotics, with major advancements expected across various sectors. These developments promise to revolutionize how we interact with technology, making our lives easier and more efficient.