TheSequence - Sakana AI
Was this email forwarded to you? Sign up here Next Week in The Sequence:
You can subscribe to The Sequence below:📝 Editorial: Sakana AIA few weeks ago, we discussed an interesting AI agent called "The AI Scientist," which was able to conduct complex experiments over the long term. The AI Scientist was created by Sakana AI, one of the most innovative AI labs in the world, which just announced a $100 million Series A funding round this week from marquee investors, including NVIDIA, Khosla Ventures, and NEA. Two fundamental aspects make Sakana AI stand out. First is its target market: Sakana AI is strategically focused on Japan. The world's leading economies are beginning to realize the importance of building world-class AI labs that develop models optimized for local knowledge. Japan is emerging as a key target, presenting a strong alternative to China in Asia. Unsurprisingly, larger competitors such as OpenAI and Cohere are also expanding their operations in the country. The second distinguishing feature of Sakana AI is its architecture for foundation models. While most large AI labs are continuing to push the scaling limits of transformers to build larger and more capable models, Sakana AI is experimenting with novel architectural paradigms to develop smaller, more efficient models. The AI Scientist is built on a unique neurosymbolic architecture that combines large language models (LLMs) with more traditional methods. Since its inception, Sakana AI has been vocal about its intention to create AI models based on evolutionary dynamics and collaboration between different expert models. Some of their early models have provided a glimpse into this approach. Competing with major AI labs has become an almost impossible challenge for startups. However, Sakana AI’s focus on a specific geographic region and smaller models might give it a crucial competitive edge. For now, their innovative models are challenging some of the conventional wisdom in the broader AI landscape. 🔎 ML ResearchOLMoEAllen AI published a paper detailing OLMoE, a fully open source mixture-of-experts(MoE) model. Specifically, the expand on the famous OLMO architecture to build. OLMoE-1B-7B and . OLMoE-1B-7B-Instruct, two MoE models trained on over 5 trillion tokens —> Read more. AlphaProteoGoogle DeepMind published a paper introducing AlphaProteo, a family of ML models for protein design. AlphaProteo can generate 3 to 300 times better binding affinities to target molecules —> Read more. Agent QResearchers from Stanford University and Multion published a paper detailing Agent Q, a framework for building web agents that can plan and heal. Agent Q combines Monte Carlo Tree Search, reinforcement learning and self-critique to build agents that interact with web environments —> Read more. DPPOResearchers from Princeton University, MIT and Carnegie Mellon University published a paper discussing diffusion policy policy optimization(DPPO), a framework for fine-tuning difusion-based policies. DPPO excels in continious control and robot learning tasks using reinforcement learning policy gradient method which are a popular policy optimization method → Read more. Evaluating LLM JailbreakingBerkeley AI Research(BAIR) published a paper proposing a technique to evaluate LLM jailbreaking methods. The paper introduces StrongREJECT , a benchmark for evaluating the robustness of jailbreaking methods in LLMs —> Read more. High Troughput, Long-Context InferenceTogether AI published a paper presenting a speculative decoding technique to increase throughput in the long-context and large batch regime. The paper introduces two new algorithms called MagicDec and Adaptive Sequoia Trees respectively in order to improve inference runs over large context windows —> Read more. 🤖 AI Tech ReleasesxLAMSalesforce open sourced xLAM, a series of LLMs optimized for function calling and agentic tasks —> Read more. Claude EnterpriseAnthropic released an enterprise version of its marquee model —> Read more. Reflection 70BHyperWrite AI open sourced Reflection 70B, a Llama based model that top several benchmark leaderboards —> Read more. 📡AI Radar
You’re on the free list for TheSequence Scope and TheSequence Chat. For the full experience, become a paying subscriber to TheSequence Edge. Trusted by thousands of subscribers from the leading AI labs and universities. |
Older messages
Edge 428: Inside PrompPoet: Character.ai's Framework for Prompt Engineering
Thursday, September 5, 2024
The open source framework abstracts the core building blocks for prompt creation, optimization and management. ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
Edge 427: Jamba Combines SSMs, Transformers and MOEs in a Single Model
Tuesday, September 3, 2024
Can a hybrid design outperform each one of the baseline architectures? ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
Cerebras Inference and the Challenges of Challenging NVIDIA’s Dominance
Sunday, September 1, 2024
Why does NVIDIA remains virtually unchallenged in the AI chip market? ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
📝 Guest Post: Will Retrieval Augmented Generation (RAG) Be Killed by Long-Context LLMs?*
Friday, August 30, 2024
Pursuing innovation and supremacy in AI shows no signs of slowing down. ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
Edge 426: Reviewing Google DeepMind’s New Tools for AI Interpretability and Guardrailing
Thursday, August 29, 2024
Gemma Scope and ShieldGemma are some of the latest additions to DeepMind's Gemma stack ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
You Might Also Like
Edge 459: Quantization Plus Distillation
Tuesday, December 24, 2024
Some insights into quantized distillation ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
Prepare for a Lifetime of Adventure with Rosetta Stone
Tuesday, December 24, 2024
The Perfect Gift For Every Traveler on Your List Rosetta Stone makes it easy to connect with the world in a whole new way. With a Lifetime Unlimited plan, users can access 25 languages to prepare for
Tuesday Triage #232
Tuesday, December 24, 2024
Your weekly crème de la crème of the Internet is here! The 232nd edition featuring fish traps, little Mussolinis, and volvelles. ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
Elastic Community Newsletter
Tuesday, December 24, 2024
Check out the latest from the Elastic Community ㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤ ㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤ ㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤ elastic | Search. Observe. Protect community-newsletter-header-img.png
Daily Coding Problem: Problem #1646 [Medium]
Monday, December 23, 2024
Daily Coding Problem Good morning! Here's your coding interview problem for today. This problem was asked by Facebook. Write a function that rotates a list by k elements. For example, [1, 2, 3, 4,
GCP Newsletter #430
Monday, December 23, 2024
Welcome to issue #430 December 23rd, 2024 News Event Official Blog Calling all devs: Code the future of baseball with Google Cloud and MLB - Google Cloud and MLB are hosting a hackathon where
⏯️ Make a Holiday Guest Profile for Your Streaming Services — What Is Linux Mint?
Monday, December 23, 2024
Also: I Played the Worst Mobile Games So You Don't Have To, and More! How-To Geek Logo December 23, 2024 Did You Know The giant splashes of color that make poinsettias a popular holiday decoration
Ranked | The Most Satisfying vs. Most Reliable Car Brands in 2024 🚙
Monday, December 23, 2024
The most reliable car brands are rarely the most satisfying to own, according to recent Consumer Reports survey data. View Online | Subscribe | Download Our App Presented by: Find the megatrends
Bitcoin Enthusiasts Are Letting Altcoins Pass by
Monday, December 23, 2024
Top Tech Content sent at Noon! Boost Your Article on HackerNoon for $159.99! Read this email in your browser How are you, @newsletterest1? 🪐 What's happening in tech today, December 23, 2024? The
Last Minute Gifts from Walmart
Monday, December 23, 2024
ZDNET ZDNET Sponsored Message In Partnership with Walmart December 23, 2024 exclusive offer Walmart Last-minute gifts from Walmart Shop Now Walmart The tech you've been wishing for–at everyday low