Data Elixir - Data Elixir - Issue 433
ISSUE 433 · April 25, 2023In the NewsInside the secret list of websites that make AI like ChatGPT sound smartIn spite of their rapid popularity, the details of how chatbots are trained are mostly hidden. In this article, The Washington Post dives into a dataset of 15 million websites that were used to train some high-profile AIs to reveal the range of input that are shaping what AIs "know." Tutorials & OpinionsMore Design Patterns For Machine Learning SystemsDesign patterns don't just make it easier to write good code. They also communicate the problem being addressed and how the code or component is intended to be used. In this continuation of a series, Eugene Yan explores common design patterns that are used in industry to build machine learning systems. Data analysis with SQLite and PythonIn a workshop at PyCon last week, Simon Willison presented a three-hour tutorial on data analysis using SQLite and Python. Here's the 9-page handout, covering the basics of using the sqlite3 module, sqlite-utils, Datasette and even a bit on Datasette Lite. Writing performant code with tidy toolsWhen computational efficiency is the priority, switching from functions in dplyr and tidyr to the backend tools underlying them can result in substantial speedups. A data analyst workflow, part 1: SQL & tidyverseNice tutorial that shows how a data analyst can use either SQL or the tidyverse for the initial stages of data exploration and then double down on tidyverse with ggplot2 for a deeper exploration. ResourcesRecommended Resources for Starting A/B TestingThere are a lot of resources for A/B testing, but where do you start? In this post, Emily Robinson and Eddie Wharton have curated some of the best resources on the web to help teams design, implement, and analyze A/B tests effectively. How to Run SurveysGreat guide on effectively running surveys for data collection. It covers things like sample selection, survey design, data collection and data analysis. It also highlights considerations to make sure your results are accurate, reliable, and useful. Data VisualizationVisually Accessible Data VisualizationCreating data visualizations that are accessible to people with common types of color blindness will enable more people to understand your visualizations and products. In this post, Derek Torsani walks through the challenges of creating accessible charts and shows the strategies used by his team at Plaid. Was this email forwarded to you? Sign up here >> |
Older messages
Data Elixir - Issue 432
Wednesday, April 19, 2023
The Data Delusion. Time series analysis. Equalized Odds in ML. Making decisions with data. How to build reproducible pipelines with R.
Data Elixir - Issue 431
Tuesday, April 11, 2023
Testing analytics code. Polars for initial data analysis. State of AI in 14 charts. R games. Data viz with ChatGPT.
Data Elixir - Issue 430
Tuesday, March 28, 2023
Data wrangling essentials. How to find hidden APIs. Data validation. A/B testing with GPT. Measuring color. Beginner's guide to databases.
Data Elixir - Issue 429
Tuesday, March 21, 2023
SQL Tutor 🤖. Structured text tools. Intro to Central Limit Theorem. Bayesian Decision Analysis. Web scraping with R. Jupyter maps.
Data Elixir - Issue 428
Tuesday, March 14, 2023
Gradient descent in SQL. Applied ML. Geographic data science with R. Algorithmic trading in python. Competitive ML.
You Might Also Like
Import AI 399: 1,000 samples to make a reasoning model; DeepSeek proliferation; Apple's self-driving car simulator
Friday, February 14, 2025
What came before the golem? ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
Defining Your Paranoia Level: Navigating Change Without the Overkill
Friday, February 14, 2025
We've all been there: trying to learn something new, only to find our old habits holding us back. We discussed today how our gut feelings about solving problems can sometimes be our own worst enemy
5 ways AI can help with taxes 🪄
Friday, February 14, 2025
Remotely control an iPhone; 💸 50+ early Presidents' Day deals -- ZDNET ZDNET Tech Today - US February 10, 2025 5 ways AI can help you with your taxes (and what not to use it for) 5 ways AI can help
Recurring Automations + Secret Updates
Friday, February 14, 2025
Smarter automations, better templates, and hidden updates to explore 👀 ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
The First Provable AI-Proof Game: Introducing Butterfly Wings 4
Friday, February 14, 2025
Top Tech Content sent at Noon! Boost Your Article on HackerNoon for $159.99! Read this email in your browser How are you, @newsletterest1? undefined The Market Today #01 Instagram (Meta) 714.52 -0.32%
GCP Newsletter #437
Friday, February 14, 2025
Welcome to issue #437 February 10th, 2025 News BigQuery Cloud Marketplace Official Blog Partners BigQuery datasets now available on Google Cloud Marketplace - Google Cloud Marketplace now offers
Charted | The 1%'s Share of U.S. Wealth Over Time (1989-2024) 💰
Friday, February 14, 2025
Discover how the share of US wealth held by the top 1% has evolved from 1989 to 2024 in this infographic. View Online | Subscribe | Download Our App Download our app to see thousands of new charts from
The Great Social Media Diaspora & Tapestry is here
Friday, February 14, 2025
Apple introduces new app called 'Apple Invites', The Iconfactory launches Tapestry, beyond the traditional portfolio, and more in this week's issue of Creativerly. Creativerly The Great
Daily Coding Problem: Problem #1689 [Medium]
Friday, February 14, 2025
Daily Coding Problem Good morning! Here's your coding interview problem for today. This problem was asked by Google. Given a linked list, sort it in O(n log n) time and constant space. For example,
📧 Stop Conflating CQRS and MediatR
Friday, February 14, 2025
Stop Conflating CQRS and MediatR Read on: my website / Read time: 4 minutes The .NET Weekly is brought to you by: Step right up to the Generative AI Use Cases Repository! See how MongoDB powers your