Llama 3 is out - Weekly News Roundup - Issue #463
Llama 3 is out - Weekly News Roundup - Issue #463Plus: brand-new, all-electric Atlas; AI Index Report 2024; Microsoft pitched GenAI tools to US military; Humane AI Pin reviews are in; debunking Devin; and more!Hello and welcome to Weekly News Roundup Issue #463. This was another eventful week in tech with a lot going on. Meta released their latest open large language model, Llama 3, which will be the main focus of this issue. Apart from that, Google announced new Gemini-powered products at Google Cloud Next 2024. The Humane AI Pin is out, and the reviews are not good. Boston Dynamics surprised everyone by retiring the hydraulic-powered Atlas and revealing the new, all-electric Atlas. These news and more will be covered in this week’s news roundup. I hope you enjoy it! Meta has released Llama 3, a family of open-weights large language models and the successor to Llama 2 model. Llama 3 comes in three sizes: the 8B and 70B parameter models are available now, while the largest model in the Llama 3 family, the 400B+ model, is still being trained and will be released later. In terms of performance, the Llama 3 models do not disappoint—at least according to the benchmark results published by Meta. These benchmarks show Llama 3 to be one of the best, if not the best, open large language models. Llama 3 8B surpasses Google's Gemma 7B and Mistral 7B models, while Llama 3 70B is at the same level, if not slightly better than, Google's Gemini Pro 1.5 and Anthropic's Claude 3 Sonnet. LMSYS Chatbot Arena, which compares large language models based on human preferences, lists Llama 3 8B in the same neighbourhood as Mixtral 7x22B, Mistral Medium and Command R models, all of which are much bigger than Llama 3 8B. Meanwhile, Llama 3 70B is ranked 7th, alongside models such as GPT-4, Claude 3 Sonnet, and Command R+. In the post announcing the new models, Meta stated that the Llama 3 models were pretrained on over 15 trillion tokens collected from publicly available sources. The training dataset is seven times larger than that used for Llama 2 and includes four times more code, so the quality of the answers should be higher. The overall picture that emerges from these benchmarks is that Llama 3 8B is the best model in its category of small models, while Llama 3 70B is the best open language model in the GPT-3.5 category and overall the best open-weights model. And there is still the 400B+ model to be released, which might be aiming for the top of the leaderboards. Meta gave us a sneak peek into the performance level of the still-in-training Llama 3 400B+ model, and the numbers they report are roughly at the same level as the best models from OpenAI, Google, and Anthropic. However, let’s keep in mind that these are numbers published by Meta. Many developers are now downloading and testing the new Llama 3 models to see what they are actually capable of and how they compare to other open and proprietary models. We should have a better picture once the results from those experiments are published. Meta promises to release updated versions of Llama 3 models in the near future. Currently, Llama 3 models are text-only models with 8,192 token context window. Meta plans to add multimodality, the ability to converse in multiple languages (Llama 3 currently only supports English), a much longer context window, and stronger overall capabilities, including improved reasoning and executing multi-step plans. Llama 3 8B and 70B models are available on HuggingFace and will arrive soon to AWS, Databricks, Google Cloud, Microsoft Azure, NVIDIA NIM, and Snowflake. Llama 3 can also be downloaded directly from Meta but that method requires signing in. The easiest way to get Llama 3 and start experimenting with it is to use tools such as Ollama if you want to run it on your computer. But if you don’t want to download and set up Llama 3 on your computer, you can try it out in Meta AI, Meta’s new AI assistant (assuming you are in one of the supported countries). Llama 3 will also be used internally by Meta to power new AI features coming soon to Facebook, Messenger, WhatsApp and Instagram, exposing billions of people to AI chatbots and generative AI tools. In conclusion, Meta, the unlikely champion for open-weight models, delivers two models offering top performance in their respective categories. That is a good news for AI and open source communities which now have access to two more very capable models to experiment with and build new AI-powered applications. But another thing the release of Llama 3 has shown is that the gaps between top models have become smaller in 2024. We are no longer in a situation where GPT-4 enjoys a massive advantage. The competitors have caught up and free models with open weights offering similar levels of performance are emerging. Now let’s see what OpenAI is going to bring to the table. If you enjoy this post, please click the ❤️ button or share it. Do you like my work? Consider becoming a paying subscriber to support it For those who prefer to make a one-off donation, you can 'buy me a coffee' via Ko-fi. Every coffee bought is a generous support towards the work put into this newsletter. Your support, in any form, is deeply appreciated and goes a long way in keeping this newsletter alive and thriving. 🦾 More than a humanPea-sized brain implant could treat depression and more What Neuralink Is Missing Rejuvenating the Blood Cell Population 🧠 Artificial IntelligenceMicrosoft Pitched OpenAI’s DALL-E as Battlefield Tool for U.S. Military Humane AI Pin reviews are in… and they are not positive The Humane AI Pin, one of the first devices in the new class built around AI, is now available, along with its reviews—and the feedback is not good. MKBHD called Humane AI Pin the worst product he ever reviewed. The Verge said the device, which costs $700 plus an additional $25 per month subscription, just does not work, while Wired concludes that Humane AI Pin is “too bare-bones and not all that useful”. The Rabbit R1 is still set to be released soon, so it will be interesting to see what it brings to the table, but I'm not keeping my hopes high. Google announced a suite of new AI tools and services at the Google Cloud Next 2024 event, many of which are powered by Gemini 1.5 Pro, now in public preview. Also, Imagen 2.0 is now generally available in Vertex AI. Gemini for Google Workspace introduces a bunch of tools aiming to increase productivity. One of those new introduced tools is Google Vids, an AI video creation tool, designed to make video production much easier. For engineers and developers, Gemini will power new coding assistance tools as well as tools for creating cloud infrastructure and improving cybersecurity. Google also announced new hardware optimized for AI for Google Cloud, including machines equipped with Nvidia’s Blackwell GPUs, coming in early 2025. What piqued my interest is Vertex AI Agent Builder, a tool for creating AI agents for various tasks, from interacting with customers to automating jobs. The full, one-hour-long opening keynote is available on YouTube. Google also published a good summary on Google Cloud blog. Artificial Intelligence Index Report 2024 Microsoft and G42 partner to accelerate AI innovation in UAE and beyond DeepMind CEO Says Google Will Spend More Than $100 Billion on AI Texas is replacing thousands of human exam graders with AI ▶️ Debunking Devin: "First AI Software Engineer" Upwork lie exposed! (25:15) A month ago, Cognition Labs released Devin, the 'First AI Software Engineer.' The company claimed that Devin has successfully passed technical tests from leading AI companies and has completed real jobs on Upwork. This video focuses on the latter claim, exposing in detail how the video showing Devin performing the task was manipulated to make the AI appear better than it actually is in reality. If you're enjoying the insights and perspectives shared in the Humanity Redefined newsletter, why not spread the word? 🤖 RoboticsThis week, Boston Dynamics bid farewell to hydraulic-powered Atlas and celebrated with 11 years of falls, jumps, dances and everything else that Atlas achieved. A day later, Boston Dynamics unveiled the brand-new, all-electric Atlas. The way new Atlas introduced itself to the world might came across as creepy but what Boston Dynamics showed is way ahead of everyone else. Boston Dynamics’ Robert Playter on the New Atlas ▶️ LASSIE - a robot-dog for the Moon (5:15) Researchers from NASA and a couple of US universities experiment with the idea of using robot dogs to explore the surface of the Moon. While no robot dog is scheduled to go to the Moon in the near future, researchers are currently testing how these robots handle difficult terrain on Earth in challenging environments that, in one way or another, mimic what such a robot might encounter on the Moon's surface. ▶️ Making a Dog-Sized Furby Robot (and taking it on a walk) (23:01) We live in a future where anybody can buy a robot dog and turn it into a Furby monstrosity. I love it. Tiny AI-trained robots demonstrate remarkable soccer skills Those tiny robots, standing just 5 cm tall, have learned to play football (or soccer) all on their own. Using deep reinforcement learning, researchers from DeepMind made the robots to figure out how to stand up, run and kick the ball in a simulated environment. After countless trials and errors, the robots learned to play football quite well in one-versus-one games. 🧬 BiotechnologyThe Wizardry and Prophecy of DNA Printing 💡TangentsWhen Bacteria Are Beautiful Thanks for reading. If you enjoyed this post, please click the ❤️ button or share it. Humanity Redefined sheds light on the bleeding edge of technology and how advancements in AI, robotics, and biotech can usher in abundance, expand humanity's horizons, and redefine what it means to be human. A big thank you to my paid subscribers, to my Patrons: whmr, Florian, dux, Eric, Preppikoma and Andrew, and to everyone who supports my work on Ko-Fi. Thank you for the support! My DMs are open to all subscribers. Feel free to drop me a message, share feedback, or just say "hi!" |
Older messages
The AI race is not slowing down - Weekly News Roundup - Issue #462
Saturday, April 13, 2024
Plus: Udio, a new AI music generator; 200+ artists sign an open letter against AI; Intel Gaudi 3 AI accelerator; Tesla to reveal robotaxis on August 8th; more humanoid robots show off their skills ͏ ͏
Microsoft is building a $100B supercomputer for AGI - Weekly News Roundup - Issue #461
Friday, April 5, 2024
Plus: Israel uses AI to target Hamas; US announces partnerships on AI safety with the UK and EU; Apple explores personal robotic projects; how biomanufacturing can move beyond steel vats; and more! ͏ ͏
The Century of Biology is upon us
Wednesday, April 3, 2024
How synthetic biology and emerging bioeconomy promises to make industrial processes cleaner, simpler and more sustainable ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
DBRX - the new king of open source LLMs - Weekly News Roundup - Issue #460
Friday, March 29, 2024
Plus: are Rabbit R1 and Humane AI Pin scams?; Anthropic gets $2.75 billion from Amazon; Grok-1.5; robotic police dog shot multiple times; drugs made in space; and more! ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
Nvidia aims to become the world's first AI foundry - Weekly News Roundup - Issue #459
Saturday, March 23, 2024
Plus: Microsoft + Inflection AI; Apple MM1; Grok is open source; a human with Neuralink implant plays chess and Civilisation; employees at top AI labs fear safety is an afterthought; and more! ͏ ͏ ͏ ͏
You Might Also Like
Unreleased Microsoft keyboard rolls out
Thursday, January 2, 2025
🪙 My wild Bitcoin ride; CES 2025; Security cam tips -- ZDNET ZDNET Tech Today - US January 2, 2025 Incase Compact Ergonomic Keyboard Exclusive: Incase announces new keyboard that Microsoft designed (
wpmail.me issue#700
Thursday, January 2, 2025
wpMail.me wpmail.me issue#700 - The weekly WordPress newsletter. No spam, no nonsense. - January 2, 2025 Is this email not displaying correctly? View it in your browser. News & Articles 10 Proven
Post from Syncfusion Blogs on 01/02/2025
Thursday, January 2, 2025
New blogs from Syncfusion Transform JSON into Stunning Charts: Auto-Generate Visuals with Syncfusion ® .NET MAUI Toolkit By Saravanan Madheswaran This blog explains how to auto-generate stunning charts
⚙️ Microsoft and OpenAI's AGI
Thursday, January 2, 2025
Plus: xAI's $12 billion
Are You Prepared for 2025’s Most Impactful Challenges?
Thursday, January 2, 2025
Stay ahead with insights and strategies for this year's biggest cybersecurity challenges. The Hacker News Cyber threats evolve fast, and 2025's threats will be no exception. Familiar challenges
Malicious Obfuscated NPM Package Disguised as an Ethereum Tool Deploys Quasar RAT
Thursday, January 2, 2025
THN Daily Updates Newsletter cover Full Stack Web Development ($54.99 Value) FREE for a Limited Time This book offers a comprehensive guide to full stack web development, covering everything from core
Re: This took me 10 minutes and protects my privacy
Thursday, January 2, 2025
My New Year's resolution is to do a better job of protecting my identity online. Last year, billions of personal records were compromised due to data breaches. That's why I wanted to tell you
Edge 462: What is Fast-LLM. The New Popular Framework for Pretraining your Own LLMs
Thursday, January 2, 2025
Created by ServiceNow, the framework provides the key building blocks for pretraining AI models. ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
The Evolution of Android Architecture Patterns
Thursday, January 2, 2025
View in browser 🔖 Articles The Evolution of Android Architecture Patterns As we welcome the New Year, dive into the transformative journey of Android architecture patterns. From MVC to MVI, explore how
🤖 Here’s How Robots are Farming Your Food — My 5 Video Game Resolutions for 2025
Wednesday, January 1, 2025
Also: Facebook Is Too Good at Suggesting Ads, and More! How-To Geek Logo January 1, 2025 Did You Know After the 1982 film ET: The Extra-Terrestrial featured Reese's Pieces prominently as a treat