Data Elixir - Data Elixir - Issue 445
ISSUE 445 · July 25, 2023PodcastsTest Driven Data Analysis - with Nick RadcliffeTest-Driven Data Analysis (TDDA) is an approach to quality for data and for data analysis pipelines. In this podcast interview, Nick Radcliffe introduces the philosophy and the associated Python library, TDDA, that he's been developing since 2016. Sponsored LinkComplete customer profiles in your data warehouseRudderStack Profiles takes the SQL grunt work out of building customer profiles. You specify the customer traits, then Profiles runs the joins and computations for you to create complete profiles, so you can build better models, faster. And that’s just one use case. Get all the details. Posts & TutorialsGetting started with Vector DBs in PythonEspecially with the popularity of LLMs, vector databases are all the rage these days. Which should you choose? Here's a great overview of nine popular options for Python, including strengths of each, sample code, and useful links along the way. Emphasize what you want readers to see with colorOne of the biggest superpowers of visualization is to lead a reader’s eye to the data you want to emphasize. By using color to create a visual hierarchy, you can decide what your readers see first, second, third, and last. This is a great post with lots of examples along the way. From zero to hero: end to end data applications with SQL and JupyterIn this online course, you'll learn how to develop and deploy an end-to-end data application with SQL, Python and Jupyter notebooks. Covers exploratory data analysis, SQL basics, workflow reproducibility, data pipelines, deployment, and more. Introduction to Cloud-Based Geospatial AnalysisNice introduction to Cloud-Based Geospatial Analysis using Google's Earth Engine and the geemap Python package. Covers the basics of Earth Engine data types and how to visualize, analyze, and export Earth Engine data in a Jupyter environment using geemap. CareerSalary CalculatorAwesome salary calculator, based on a compensation prediction model that was built for a Kaggle competition in 2022. This was made specifically for people who work with data and there are a lot of options, including a variety of positions, locations, experience levels, and more. ResourcesCookbook Polars for RThis cookbook for R users offers a side-by-side comparison of polars, R base, dplyr, tidyr and data.table for common tasks and problems. Comprehensive Python CheatsheetExhaustive and concise — a truly Pythonic cheat sheet for the Python programming language. OutlierLLM Training PuzzlesThis is a collection of 8 challenging puzzles about training large language models (or really any NN) on many, many GPUs. The goal of these puzzles is to get hands-on experience with the key primitives and to understand the goals of memory efficiency and compute pipelining. Was this email forwarded to you? Sign up here >> |
Older messages
Data Elixir - Issue 444
Tuesday, July 18, 2023
Advanced Python. Financial ML. GPT-4 superpowers. VScode + Docker + Python = ❤️. Dimensionality reduction. Regulating AI.
Data Elixir - Issue 443
Tuesday, July 11, 2023
Unraveling PCA. Hidden tools in Python. SQL inner joins. Statistical learning for python. Demystifying text data. How to do great work.
Data Elixir - Issue 442
Tuesday, June 27, 2023
Polars cookbook. LLM-powered autonomous agents. Time series with ML. Scalable & extensible viz. ML system design.
Data Elixir - Issue 441
Tuesday, June 20, 2023
Julia programming for ML. Spatial statistics. Raincloud plots. Artifact corrections for effect sizes. Perils of faking data in Excel. Private LLMs for DB interactions.
Data Elixir - Issue 440
Tuesday, June 13, 2023
NFL Analytics. Sequential testing. Data + Music. Managing generative AI risks. FinGPT: open-source LLM for finance. Data exploration toolkit.
You Might Also Like
Import AI 399: 1,000 samples to make a reasoning model; DeepSeek proliferation; Apple's self-driving car simulator
Friday, February 14, 2025
What came before the golem? ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
Defining Your Paranoia Level: Navigating Change Without the Overkill
Friday, February 14, 2025
We've all been there: trying to learn something new, only to find our old habits holding us back. We discussed today how our gut feelings about solving problems can sometimes be our own worst enemy
5 ways AI can help with taxes 🪄
Friday, February 14, 2025
Remotely control an iPhone; 💸 50+ early Presidents' Day deals -- ZDNET ZDNET Tech Today - US February 10, 2025 5 ways AI can help you with your taxes (and what not to use it for) 5 ways AI can help
Recurring Automations + Secret Updates
Friday, February 14, 2025
Smarter automations, better templates, and hidden updates to explore 👀 ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
The First Provable AI-Proof Game: Introducing Butterfly Wings 4
Friday, February 14, 2025
Top Tech Content sent at Noon! Boost Your Article on HackerNoon for $159.99! Read this email in your browser How are you, @newsletterest1? undefined The Market Today #01 Instagram (Meta) 714.52 -0.32%
GCP Newsletter #437
Friday, February 14, 2025
Welcome to issue #437 February 10th, 2025 News BigQuery Cloud Marketplace Official Blog Partners BigQuery datasets now available on Google Cloud Marketplace - Google Cloud Marketplace now offers
Charted | The 1%'s Share of U.S. Wealth Over Time (1989-2024) 💰
Friday, February 14, 2025
Discover how the share of US wealth held by the top 1% has evolved from 1989 to 2024 in this infographic. View Online | Subscribe | Download Our App Download our app to see thousands of new charts from
The Great Social Media Diaspora & Tapestry is here
Friday, February 14, 2025
Apple introduces new app called 'Apple Invites', The Iconfactory launches Tapestry, beyond the traditional portfolio, and more in this week's issue of Creativerly. Creativerly The Great
Daily Coding Problem: Problem #1689 [Medium]
Friday, February 14, 2025
Daily Coding Problem Good morning! Here's your coding interview problem for today. This problem was asked by Google. Given a linked list, sort it in O(n log n) time and constant space. For example,
📧 Stop Conflating CQRS and MediatR
Friday, February 14, 2025
Stop Conflating CQRS and MediatR Read on: my website / Read time: 4 minutes The .NET Weekly is brought to you by: Step right up to the Generative AI Use Cases Repository! See how MongoDB powers your