Humanity Redefined - Google's Agentic Era - Sync #497
I hope you enjoy this free post. If you do, please like ❤️ or share it, for example by forwarding this email to a friend or colleague. Writing this post took around eight hours to write. Liking or sharing it takes less than eight seconds and makes a huge difference. Thank you! Google's Agentic Era - Sync #497Plus: Sora is out; OpenAI vs Musk drama continues; GM closes Cruise; Amazon opens AGI lab; Devin is out; a humanoid robot with artificial muscles; NASA's new Martian helicopter; and more!Hello and welcome to Sync #497! It has been an interesting week in AI, with OpenAI finally releasing Sora and Google revealing Gemini 2.0 and its vision for the agentic era, which we will examine more closely in this week’s issue of Sync. Elsewhere in AI, the feud between OpenAI and Elon Musk continues. This time, OpenAI responded with its side of the story, revealing that Musk supported OpenAI becoming a for-profit company. In other news, Devin is out, Amazon has opened an AGI lab, and every other top AI company received poor grades on safety. In robotics, GM suddenly pulled the plug on Cruise and will absorb the robotaxi company. Meanwhile, NASA plans a bigger Martian helicopter as a follow-up to Ingenuity, the first flying machine on another planet. We will also see a drone that leaps into the air like a bird and a humanoid robot powered by pneumatic artificial muscles. Additionally, we will meet a neurotech company that implants brain-computer interfaces without brain surgery and learn why the idea of designer babies and selecting “genes for X” is not likely to happen anytime soon. Enjoy! Google's Agentic EraA year ago, Google unveiled Gemini, its family of natively multimodal large language models in three sizes: Nano, Pro, and Ultra. A couple of months later, in February 2024, Google released Gemini 1.5, which introduced improvements in performance and a larger context window, enabling these models to process more input data than before. In May 2024, the Gemini family expanded with Gemini 1.5 Flash—a lightweight model designed for greater speed and efficiency. Last week, Google launched the next generation of Gemini models—Gemini 2.0—and outlined its vision for the agentic era. Google introduces Gemini 2.0 FlashThe first model revealed from the Gemini 2.0 family is Gemini 2.0 Flash. According to benchmarks provided by Google, the new Gemini 2.0 Flash, a lightweight model in the Gemini 2.0 family, already outperforms Gemini 1.5 Pro and generates responses twice as quickly. I remind you to take any first-party benchmarks with a grain of salt, as the companies releasing them aim to present their latest models in the best possible light. In the case of Gemini 2.0 Flash, the numbers look promising, but where are the comparisons to other models, such as GPT-4o, GPT-4o Mini, Anthropic’s Claude 3.5 Haiku and Sonnet, or Meta’s Llama 3? In addition to supporting multimodal inputs such as images, video, and audio, Gemini 2.0 Flash now supports multimodal output, including natively generated images combined with text and steerable text-to-speech multilingual audio. It can also natively call tools like Google Search, execute code, and use third-party user-defined functions. Gemini 2.0 Flash is available as an experimental model to developers via the Gemini API in Google AI Studio and Vertex AI. Multimodal input and text output are accessible to all developers, while text-to-speech and native image generation are available to early-access partners. General availability is scheduled for January, along with additional model sizes. Gemini 2.0 Flash is also available in the Gemini app, Google’s AI assistant and equivalent to OpenAI’s ChatGPT. Google enters the agentic eraWith the release of Gemini 2.0, Google has clearly outlined its plan for AI, which includes enhancing all Google apps and services with AI and advancing the development of AI agents. The first part of Google’s AI plan is already underway. Various Google apps and services, from Search to Gmail and even Maps, are receiving generative AI updates and new features, all powered by Gemini models. The second part of Google’s AI plan—AI agents—is still in development but Google has shown a glimpse of how it envisions the agentic era to look. The first version of Google’s agentic AI agents will most likely look like the recently revealed Project Mariner. Project Mariner is a Google Chrome extension featuring an AI chatbot that can process complex requests, break them into smaller tasks, and complete them by taking control of the user’s browser. Project Mariner is currently available only to trusted testers, with no information yet on when it will be publicly available. One interesting detail I noticed during the Project Mariner demo is the glowing blue edges, which indicate that the AI has taken control of the screen. It seems this is how tech companies plan to signal when AI is in control. Apple demonstrated something similar earlier this year with glowing edges appearing during interactions with Siri. Another example of the first wave of Google’s AI agents is Deep Research. This mode in the Gemini app handles the heavy lifting of doing research, analysing results, and preparing a report on your behalf. Additionally, there is Jules, an AI coding assistant similar to GitHub Copilot or Devin, which allows you to describe a new feature or bug, and the assistant writes the code for you. However, Google does not intend to stop at chatbots. Its vision is to develop highly capable AI assistants that can comprehend complex scenarios, devise multi-step plans, and execute them on behalf of the user. If Project Mariner is a showcase of what we can have now, then Project Astra is painting a picture of what is possible. First introduced at the Google I/O 2024 event, Project Astra is what Google describes as a universal AI assistant, designed to serve as a personal assistant for everyday life. The demos shown so far envision an app that integrates multiple sources of information—photos, videos, audio, and geolocation—to provide quick and accurate responses to user questions. With the release of Gemini 2.0, Project Astra can now converse in multiple languages and mixed languages, demonstrating an improved understanding of accents and uncommon words. It also has now up to 10 minutes of in-session memory and can recall previous conversations. Additionally, Project Astra now has the ability to utilise other apps like Google Search, Maps, or Lens, while delivering faster responses. Google has not announced a release date for Project Astra. The company has only stated that it is expanding the trusted tester program to include more participants, including a small group that will soon begin testing Project Astra on prototype glasses. Google’s vision for its agentic era is interesting. However, what we have so far are mostly polished demos and promises of possibly releasing these tools for widespread use at some point in the future. The true test of Google’s AI agents will be whether they prove useful in everyday life or remain as interesting tech demos in search of practical applications. If you enjoy this post, please click the ❤️ button or share it. Do you like my work? Consider becoming a paying subscriber to support it For those who prefer to make a one-off donation, you can 'buy me a coffee' via Ko-fi. Every coffee bought is a generous support towards the work put into this newsletter. Your support, in any form, is deeply appreciated and goes a long way in keeping this newsletter alive and thriving. 🦾 More than a humanHow Neuralink’s chief competitor is tapping into the brain without surgery Brain Stimulation Helps People Walk After Spinal Injury Those designer babies everyone is freaking out about – it’s not likely to happen 🧠 Artificial IntelligenceSora is here Elon Musk wanted an OpenAI for-profit Devin is now generally available Anthropic’s 3.5 Haiku model comes to Claude users Amazon opens new AI lab in San Francisco focused on long-term research bets Leading AI Companies Get Lousy Grades on Safety Future of Life Institute has released a report grading leading AI companies on their risk assessment efforts and safety procedures. Anthropic earned the highest grade—a C—highlighting the need for industry-wide improvement. “The purpose of this is not to shame anybody,” says Max Tegmark, an MIT physics professor and president of the Future of Life Institute. “It’s to provide incentives for companies to improve.” Apple reportedly developing AI server chip with Broadcom GenCast predicts weather and the risks of extreme conditions with state-of-the-art accuracy How Mark Zuckerberg has fully rebuilt Meta around Llama ▶️ Inside a MEGA AI GPU Server with the NVIDIA HGX H200 (19:35) Patrick from ServeTheHome got his hands on the Nvidia HGX H200, an AI-focused computing powerhouse featuring eight Nvidia H200 GPUs with a combined 1.128 terabytes of memory and delivering 32 petaFLOPS of AI performance. In this video, Patrick opens up the server, walks us through its components in detail, and puts it through various benchmarks, including power consumption tests. It is always fascinating to get an up-close look at the hardware powering modern AI models and see how those AI computing monsters are built. If you're enjoying the insights and perspectives shared in the Humanity Redefined newsletter, why not spread the word? 🤖 RoboticsCruise employees ‘blindsided’ by GM’s plan to end robotaxi program ANYbotics raises $60M to expand inspection robot deployments worldwide ▶️ NASA’s Mars Chopper Concept (0:37) Following the massive success of Ingenuity, the first flying machine operating on another planet, NASA is planning to raise the bar by designing, building and sending to Mars a much bigger helicopter. Named Mars Chopper, the project is in the early conceptual and design stages. Beyond scouting, such a helicopter could carry scientific instruments to study terrains that rovers are unable to reach. AI Company That Made Robots For Children Went Bust And Now The Robots Are Dying ▶️ Watch this bird-inspired robotic drone leap into the air (2:01) Meet RAVEN (short for Robotic Avian-inspired Vehicle for multiple ENvironments), a hybrid between a drone and a robotic bird. Using its bird-like legs, RAVEN can walk and hop like a bird. And like a bird, it can also use its legs to leap into the air and then fly like a drone aeroplane. The team behind RAVEN believes the design principles used to build RAVEN could inspire the creation of even more versatile robots. ▶️ Torso 2 by Clone with Actuated Abdomen (1:02) The humanoid robot developed by Clone Robotics is one of the most unique designs I’ve ever seen. Instead of electric motors, Clone Robotics uses artificial muscle fibres to move its body, and this video demonstrates how the robot moves using these artificial muscles. For this demonstration, the team is using pneumatic actuators with off-the-shelf valves. The final product will use hydraulic actuators and be much quieter. 💡TangentsIntel on the Brink of Death Thanks for reading. If you enjoyed this post, please click the ❤️ button or share it. Humanity Redefined sheds light on the bleeding edge of technology and how advancements in AI, robotics, and biotech can usher in abundance, expand humanity's horizons, and redefine what it means to be human. A big thank you to my paid subscribers, to my Patrons: whmr, Florian, dux, Eric, Preppikoma and Andrew, and to everyone who supports my work on Ko-Fi. Thank you for the support! My DMs are open to all subscribers. Feel free to drop me a message, share feedback, or just say "hi!" |
Older messages
OpenAI o1 goes Pro - Sync #496
Tuesday, December 10, 2024
Plus: DeepMind Genie 2; Google released Veo and Imagen 3 on Vertex AI; Tesla Optimus shows off new hand; Grok is free for all X users; ads might be coming to ChatGPT; Waymo comes to Miami; and more! ͏
Artists against AI - Sync #495
Saturday, November 30, 2024
Plus: Amazon's AI chips; OpenAI web browser with ChatGPT; new humanoid robot video; Human Cell Atlas releases its first draft; can humans hibernate?; AI agents behaving very human-like; and more! ͏
OpenAI's turbulent early years - Sync #494
Sunday, November 24, 2024
Plus: Anthropic and xAI raise billions of dollars; can a fluffy robot replace a living pet; Chinese reasoning model DeepSeek R1; robot-dog runs full marathon; a $12000 surgery to change eye colour ͏ ͏
Cracks in the Scaling Laws - Sync #493
Sunday, November 17, 2024
Plus: OpenAI's new AI agent; AlphaFold3 is open-source... kind of; Amazon releases its new AI chip; Waymo One is available for everyone in LA; how can humanity become a Kardashev Type 1
Robotics is the new AI - Sync #492
Sunday, November 10, 2024
Plus: OpenAI's and Google's new models leaked; the state of robotic investments in 2024; pest control with CRISPR; drone with a flamethrower; 81-year-old biohacker trying to reverse ageing; and
You Might Also Like
Next Holiday Season, Ignore Everyone Except One Customer
Saturday, December 21, 2024
Top Tech Content sent at Noon! Boost Your Article on HackerNoon for $159.99! Read this email in your browser How are you, @newsletterest1? 🪐 What's happening in tech today, December 21, 2024? The
🐍 New Python tutorials on Real Python
Saturday, December 21, 2024
Hey there, There's always something going on over at Real Python as far as Python tutorials go. Here's what you may have missed this past week: 🎓 Master Python's Core Principles (New Live
Post from Syncfusion Blogs on 12/21/2024
Saturday, December 21, 2024
New blogs from Syncfusion ASP.NET MVC Suite Update: Aligning with .NET Changes By Rajendran R Discover key updates in our ASP.NET MVC suite, aligning with Microsoft's latest .NET changes for
LockBit Ransomware Developer Charged for Billions in Global Damages
Saturday, December 21, 2024
THN Daily Updates Newsletter cover The Data Science Handbook, 2nd Edition ($60.00 Value) FREE for a Limited Time Practical, accessible guide to becoming a data scientist, updated to include the latest
Re: My VPN recommendation
Saturday, December 21, 2024
Do you know when to use a VPN and what it does to protect your data? Any time you are connected to the internet, your information is at risk of being tracked or hacked. A VPN helps keep your surfing
📧 Scheduling Background Jobs With Quartz in .NET (advanced concepts)
Saturday, December 21, 2024
Scheduling Background Jobs With Quartz in .NET (advanced concepts) Read on: my website / Read time: 6 minutes The .NET Weekly is brought to you by: It's been a big year for API collaborations!
The Thrill Was Never There 🎸
Saturday, December 21, 2024
Takeaways from a punk-rock creator who says he doesn't like punk. Here's a version for your browser. Hunting for the end of the long tail • December 20, 2024 The Thrill Was Never There A famous
🎮 Smartphones Will Never Kill Dedicated Handhelds — 11 Stocking Stuffers for iPhone Owners
Friday, December 20, 2024
Also: How to Add a Smart Speaker to Your Home Assistant Setup How-To Geek Logo December 20, 2024 Did You Know The nursery rhyme "Mary Had a Little Lamb" is based on a true story. The girl in
Daily Coding Problem: Problem #1643 [Easy]
Friday, December 20, 2024
Daily Coding Problem Good morning! Here's your coding interview problem for today. This problem was asked by Facebook. Given a 32-bit integer, return the number with its bits reversed. For example,
JSK Daily for Dec 20, 2024
Friday, December 20, 2024
JSK Daily for Dec 20, 2024 View this email in your browser A community curated daily e-mail of JavaScript news Empower Your Data Insights: Integrating JavaScript Gantt Chart into Power BI Syncfusion