Data Elixir - Data Elixir - Issue 447
ISSUE 447 · August 8, 2023Posts & TutorialsDo ML Models Memorize or Generalize?In 2021, researchers discovered "grokking," where tiny models suddenly shift from memorizing to generalizing unseen inputs. This interactive article explores this phenomenon and the emerging field of mechanistic interpretability, seeking insights into whether large language models generalize or merely memorize. LLMs, explained with a minimum of math and jargonIf you're new to large language models or looking for a good explainer to share with colleagues, here's an easy-to-follow, gentle primer. Sponsored LinkGenerative AI Skills ChallengeGreat ideas wanted! 💡 data.org is looking for innovative proposals on training and upskilling in generative AI to drive social impact. The Generative AI Skills Challenge will award funding and technical assistance to awardees -- click here to learn more and apply by August 15, 2023 (7:00 PM ET). Functions are VectorsConceptualizing functions as infinite-dimensional vectors lets you apply the tools of linear algebra to a vast landscape of new problems, from image and geometry processing to curve fitting, light transport, and machine learning. Great post! Jazz up your ggplots!Useful tricks for customizing ggplot design, with complete code examples to try on your own. Covers plot animation with gganimate, chart composition with cowplot, shapes with ggimage, annotations with geomtextpath, highlighting elements with gghighlight, special effects with ggfx, custom themes, and more. Log transforms, geometric means and estimating population totalsWhile log transformation can create robust models with lower heteroskedasticity and better compliance with standard assumptions, it could potentially distort population estimates. This post uses a practical example to show the possible consequences of log transformations, including diagnostic plots and estimations. Tools & CodeFinance ToolkitThe FinanceToolkit is an open-source toolkit for stock market analysis. It offers a comprehensive set of financial ratios, inidicators and performance ratios and all calculations are simple, clearly presented, and can be customized. This is an awesome resource for anyone interested in either learning about or working with finance data. Generative AI in JupyterJupyter AI brings generative AI to Jupyter notebooks, giving users the power to explain and generate code, fix errors, summarize content, ask questions about their local files, and generate entire notebooks from a natural language prompt. Resources🕶️ Awesome QuartoAwesome selection of Quarto docs, tutorials, talks, posts, tools and examples from around the web. ML⇄DB Seminar SeriesDatabases and machine learning are inextricably linked. Databases provide the storage for the vast volumes of data that's required by ML algorithms and, in turn, the ML algorithms infuse the databases with new capabilities. In this free seminar series, speakers from industry explore this growing convergence. Was this email forwarded to you? Sign up here >> |
Older messages
Data Elixir - Issue 446
Tuesday, August 1, 2023
Nix for data science. Telling Stories with Data. Design patterns for LLM systems & products. Practical guide to conjoint analysis. Treemaps.
Data Elixir - Issue 445
Tuesday, July 25, 2023
Salary Calculator. Python vector DBs. Visual superpowers. Polars for R Cookbook. Test Driven Data Analysis. Python cheatsheet.
Data Elixir - Issue 444
Tuesday, July 18, 2023
Advanced Python. Financial ML. GPT-4 superpowers. VScode + Docker + Python = ❤️. Dimensionality reduction. Regulating AI.
Data Elixir - Issue 443
Tuesday, July 11, 2023
Unraveling PCA. Hidden tools in Python. SQL inner joins. Statistical learning for python. Demystifying text data. How to do great work.
Data Elixir - Issue 442
Tuesday, June 27, 2023
Polars cookbook. LLM-powered autonomous agents. Time series with ML. Scalable & extensible viz. ML system design.
You Might Also Like
⚠️ Avoiding AI Scams on Social Media — An Open Source Google Photos Alternative
Sunday, May 5, 2024
Also: Reviewing the Customizable Drop Mechanical Keyboard, and More! How-To Geek Logo May 5, 2024 📩 Get expert reviews, the hottest deals, how-to's, breaking news, and more delivered directly to
Daily Coding Problem: Problem #1432 [Medium]
Sunday, May 5, 2024
Daily Coding Problem Good morning! Here's your coding interview problem for today. This question was asked by Snapchat. Given the head to a singly linked list, where each node also has a “random”
PD#572 Good Ideas in Computer Science
Sunday, May 5, 2024
Ideas every programmer likes and why Garbage Collection and Object Oriented Programming don't count
RD#454 API Layer & Fetch Functions
Sunday, May 5, 2024
ixing API and UI code quickly leads to messy and unmaintainable code
The Shiny Toy Syndrome & Tiny macOS utility apps I love
Sunday, May 5, 2024
Lex launching its redesign, Raycast shares another monthly update packed with AI updates, prompts should be designed not engineered, and a lot more in this week's issue of Creativerly. Creativerly
Hyundai antes up $1B for AV startup Motional and Elon unplugs the Tesla Supercharger team
Sunday, May 5, 2024
Plus, layoffs come for Luminar, Fisker and Ola View this email online in your browser By Kirsten Korosec Sunday, May 5, 2024 Image Credits: Motional Welcome back to TechCrunch Mobility — your central
C#504 Adventures serializing absolutely everything in C#
Sunday, May 5, 2024
A fantastic journey porting Newtonsoft.Json to System.Text.Json
Sunday Digest | Featuring 'Which City Has the Most Billionaires in 2024?' 📊
Sunday, May 5, 2024
Every visualization published this week, in one place. Visual Capitalist Sunday Digest logo May 5, 2024 | View Online | Subscribe | VC+ The Best of This Week's Visuals Presented by Voronoi: The
The dark side of startup accelerators
Sunday, May 5, 2024
Plus: No easy solution to AI hallucinations View this email online in your browser By Anthony Ha Sunday, May 5, 2024 Image Credits: Bryce Durbin This Week, TechCrunch dug into the struggles at two
Android Weekly #621
Sunday, May 5, 2024
View in web browser 621 May 5th, 2024 Articles & Tutorials Sponsored Genius Scan SDK: a document scanner in your app Embed a reliable document scanner with OCR in your app, enabling your customers