Data Elixir - Issue 434
ISSUE 434 · May 2, 2023Tutorials & OpinionsSynthetic data could be better than real dataGetting computers to manufacture realistic data could help address privacy concerns in domains like medicine and education. And if researchers can find the right balance between accuracy and fakery, synthetic data could also help create better datasets to begin with. p Values Are Useful for A/B Tests, SometimesWhen you're running A/B tests, does it make sense to use p-values? According to the American Statistical Association, the answer is "no" but don't take that as a hard rule. In this post, Harlan Harris explores the use of p-values in A/B tests and when it does and doesn't make sense to use them. Sponsored LinkUse this to do data planning with your business teamsArrive at clear definitions for business metrics with stakeholders. Record how metrics like leads, users, and customers are defined, and use it to document how other North Star metrics, like ARR, are being tracked. Get it here. Tools & CodecharlatanInspired by Python's faker package, charlatan makes it easy to create fake data using R. Use it to create dummy data for names, phone numbers, emails, DOI numbers, genes, and more! ResourcesUnderstanding Large Language ModelsGreat reading list for getting up to speed with LLMs. This list explores some of the most influential papers for understanding the design, constraints, and evolution of LLMs — starting at the beginning. For each paper there's a short summary, key diagrams, and a link to the paper. The Little Book of Deep LearningNice introduction to deep learning, starting with the basics of machine learning and efficient computation. From there, it covers a variety of topics including model components, architectures, and applications. Sharpen your math, CS and data skills in 15 minutes a dayFor professionals and lifelong learners alike, Brilliant is one of the best ways to learn. The deets: Bite-sized interactive lessons make it easy to level up in everything from math and data science to AI and beyond. Join 10+ million people building skills every day. Start your 30-day free trial today! CareerHow Academic Bullying Led This Data Scientist to Open ScienceUnless you've been there, it's easy to be idealistic about jobs in academia. In this post, Paola Chiara Masuzzo shares her 12-year journey starting as an excited PhD student to being disillusioned with academia and the path she took to rediscover her love for science. New OpportunitiesThe Data Elixir Job Board currently lists 55 openings for a variety of roles, including data scientists, data analysts, machine learning engineers, researchers, and more. The roles cover a variety of levels, from entry-level to Director and most of the jobs are remote. Data VisualizationMaking Middle Earth maps with RMiddle Earth might not be the most obvious place for a data visualization tutorial but this sure works well. Starting with a set of open-source shapefiles from J. R. R. Tolkien's Middle Earth, the tutorial walks through a variety of spatial data visualization techniques, including projections, map layers, distances, scaling and more! Awesome ggplot2 🕶️Great collection of curated ggplot2 resources including tutorials, packages, books, courses, galleries, related repos, and people to follow. Was this email forwarded to you? Sign up here >> |
Older messages
Data Elixir - Issue 433
Tuesday, April 25, 2023
ML design patterns. Analysis with SQLite and Python. A/B testing resources. Performant tidy code. How to run surveys.
Data Elixir - Issue 432
Wednesday, April 19, 2023
The Data Delusion. Time series analysis. Equalized Odds in ML. Making decisions with data. How to build reproducible pipelines with R.
Data Elixir - Issue 431
Tuesday, April 11, 2023
Testing analytics code. Polars for initial data analysis. State of AI in 14 charts. R games. Data viz with ChatGPT.
Data Elixir - Issue 430
Tuesday, March 28, 2023
Data wrangling essentials. How to find hidden APIs. Data validation. A/B testing with GPT. Measuring color. Beginner's guide to databases.
Data Elixir - Issue 429
Tuesday, March 21, 2023
SQL Tutor 🤖. Structured text tools. Intro to Central Limit Theorem. Bayesian Decision Analysis. Web scraping with R. Jupyter maps.
You Might Also Like
Import AI 399: 1,000 samples to make a reasoning model; DeepSeek proliferation; Apple's self-driving car simulator
Friday, February 14, 2025
What came before the golem? ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
Defining Your Paranoia Level: Navigating Change Without the Overkill
Friday, February 14, 2025
We've all been there: trying to learn something new, only to find our old habits holding us back. We discussed today how our gut feelings about solving problems can sometimes be our own worst enemy
5 ways AI can help with taxes 🪄
Friday, February 14, 2025
Remotely control an iPhone; 💸 50+ early Presidents' Day deals -- ZDNET ZDNET Tech Today - US February 10, 2025 5 ways AI can help you with your taxes (and what not to use it for) 5 ways AI can help
Recurring Automations + Secret Updates
Friday, February 14, 2025
Smarter automations, better templates, and hidden updates to explore 👀 ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
The First Provable AI-Proof Game: Introducing Butterfly Wings 4
Friday, February 14, 2025
Top Tech Content sent at Noon! Boost Your Article on HackerNoon for $159.99! Read this email in your browser How are you, @newsletterest1? undefined The Market Today #01 Instagram (Meta) 714.52 -0.32%
GCP Newsletter #437
Friday, February 14, 2025
Welcome to issue #437 February 10th, 2025 News BigQuery Cloud Marketplace Official Blog Partners BigQuery datasets now available on Google Cloud Marketplace - Google Cloud Marketplace now offers
Charted | The 1%'s Share of U.S. Wealth Over Time (1989-2024) 💰
Friday, February 14, 2025
Discover how the share of US wealth held by the top 1% has evolved from 1989 to 2024 in this infographic. View Online | Subscribe | Download Our App Download our app to see thousands of new charts from
The Great Social Media Diaspora & Tapestry is here
Friday, February 14, 2025
Apple introduces new app called 'Apple Invites', The Iconfactory launches Tapestry, beyond the traditional portfolio, and more in this week's issue of Creativerly. Creativerly The Great
Daily Coding Problem: Problem #1689 [Medium]
Friday, February 14, 2025
Daily Coding Problem Good morning! Here's your coding interview problem for today. This problem was asked by Google. Given a linked list, sort it in O(n log n) time and constant space. For example,
📧 Stop Conflating CQRS and MediatR
Friday, February 14, 2025
Stop Conflating CQRS and MediatR Read on: my website / Read time: 4 minutes The .NET Weekly is brought to you by: Step right up to the Generative AI Use Cases Repository! See how MongoDB powers your