Data Elixir - Data Elixir - Issue 447
ISSUE 447 · August 8, 2023Posts & TutorialsDo ML Models Memorize or Generalize?In 2021, researchers discovered "grokking," where tiny models suddenly shift from memorizing to generalizing unseen inputs. This interactive article explores this phenomenon and the emerging field of mechanistic interpretability, seeking insights into whether large language models generalize or merely memorize. LLMs, explained with a minimum of math and jargonIf you're new to large language models or looking for a good explainer to share with colleagues, here's an easy-to-follow, gentle primer. Sponsored LinkGenerative AI Skills ChallengeGreat ideas wanted! 💡 data.org is looking for innovative proposals on training and upskilling in generative AI to drive social impact. The Generative AI Skills Challenge will award funding and technical assistance to awardees -- click here to learn more and apply by August 15, 2023 (7:00 PM ET). Functions are VectorsConceptualizing functions as infinite-dimensional vectors lets you apply the tools of linear algebra to a vast landscape of new problems, from image and geometry processing to curve fitting, light transport, and machine learning. Great post! Jazz up your ggplots!Useful tricks for customizing ggplot design, with complete code examples to try on your own. Covers plot animation with gganimate, chart composition with cowplot, shapes with ggimage, annotations with geomtextpath, highlighting elements with gghighlight, special effects with ggfx, custom themes, and more. Log transforms, geometric means and estimating population totalsWhile log transformation can create robust models with lower heteroskedasticity and better compliance with standard assumptions, it could potentially distort population estimates. This post uses a practical example to show the possible consequences of log transformations, including diagnostic plots and estimations. Tools & CodeFinance ToolkitThe FinanceToolkit is an open-source toolkit for stock market analysis. It offers a comprehensive set of financial ratios, inidicators and performance ratios and all calculations are simple, clearly presented, and can be customized. This is an awesome resource for anyone interested in either learning about or working with finance data. Generative AI in JupyterJupyter AI brings generative AI to Jupyter notebooks, giving users the power to explain and generate code, fix errors, summarize content, ask questions about their local files, and generate entire notebooks from a natural language prompt. Resources🕶️ Awesome QuartoAwesome selection of Quarto docs, tutorials, talks, posts, tools and examples from around the web. ML⇄DB Seminar SeriesDatabases and machine learning are inextricably linked. Databases provide the storage for the vast volumes of data that's required by ML algorithms and, in turn, the ML algorithms infuse the databases with new capabilities. In this free seminar series, speakers from industry explore this growing convergence. Was this email forwarded to you? Sign up here >> |
Older messages
Data Elixir - Issue 446
Tuesday, August 1, 2023
Nix for data science. Telling Stories with Data. Design patterns for LLM systems & products. Practical guide to conjoint analysis. Treemaps.
Data Elixir - Issue 445
Tuesday, July 25, 2023
Salary Calculator. Python vector DBs. Visual superpowers. Polars for R Cookbook. Test Driven Data Analysis. Python cheatsheet.
Data Elixir - Issue 444
Tuesday, July 18, 2023
Advanced Python. Financial ML. GPT-4 superpowers. VScode + Docker + Python = ❤️. Dimensionality reduction. Regulating AI.
Data Elixir - Issue 443
Tuesday, July 11, 2023
Unraveling PCA. Hidden tools in Python. SQL inner joins. Statistical learning for python. Demystifying text data. How to do great work.
Data Elixir - Issue 442
Tuesday, June 27, 2023
Polars cookbook. LLM-powered autonomous agents. Time series with ML. Scalable & extensible viz. ML system design.
You Might Also Like
Import AI 399: 1,000 samples to make a reasoning model; DeepSeek proliferation; Apple's self-driving car simulator
Friday, February 14, 2025
What came before the golem? ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
Defining Your Paranoia Level: Navigating Change Without the Overkill
Friday, February 14, 2025
We've all been there: trying to learn something new, only to find our old habits holding us back. We discussed today how our gut feelings about solving problems can sometimes be our own worst enemy
5 ways AI can help with taxes 🪄
Friday, February 14, 2025
Remotely control an iPhone; 💸 50+ early Presidents' Day deals -- ZDNET ZDNET Tech Today - US February 10, 2025 5 ways AI can help you with your taxes (and what not to use it for) 5 ways AI can help
Recurring Automations + Secret Updates
Friday, February 14, 2025
Smarter automations, better templates, and hidden updates to explore 👀 ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
The First Provable AI-Proof Game: Introducing Butterfly Wings 4
Friday, February 14, 2025
Top Tech Content sent at Noon! Boost Your Article on HackerNoon for $159.99! Read this email in your browser How are you, @newsletterest1? undefined The Market Today #01 Instagram (Meta) 714.52 -0.32%
GCP Newsletter #437
Friday, February 14, 2025
Welcome to issue #437 February 10th, 2025 News BigQuery Cloud Marketplace Official Blog Partners BigQuery datasets now available on Google Cloud Marketplace - Google Cloud Marketplace now offers
Charted | The 1%'s Share of U.S. Wealth Over Time (1989-2024) 💰
Friday, February 14, 2025
Discover how the share of US wealth held by the top 1% has evolved from 1989 to 2024 in this infographic. View Online | Subscribe | Download Our App Download our app to see thousands of new charts from
The Great Social Media Diaspora & Tapestry is here
Friday, February 14, 2025
Apple introduces new app called 'Apple Invites', The Iconfactory launches Tapestry, beyond the traditional portfolio, and more in this week's issue of Creativerly. Creativerly The Great
Daily Coding Problem: Problem #1689 [Medium]
Friday, February 14, 2025
Daily Coding Problem Good morning! Here's your coding interview problem for today. This problem was asked by Google. Given a linked list, sort it in O(n log n) time and constant space. For example,
📧 Stop Conflating CQRS and MediatR
Friday, February 14, 2025
Stop Conflating CQRS and MediatR Read on: my website / Read time: 4 minutes The .NET Weekly is brought to you by: Step right up to the Generative AI Use Cases Repository! See how MongoDB powers your