Data Elixir - Data Elixir - Issue 374
ISSUE 374 · February 15, 2022In the NewsML Becomes a Mathematical CollaboratorMathematicians often work together when they’re searching for insight into a hard problem. It’s a kind of a freewheeling collaborative process that seems to require a uniquely human touch. But in two new results, the role of human collaborator has been replaced in part by a machine... Sponsored LinkDelivering Accurate Ground Truth Data for AI/ML Models30+ years experience working with leading data-centric AI/ML models. With 3,500+ global SMEs and experience with any data type, we accelerate operations and advance models in record time. Scale your AI with your model's new secret weapon, Innodata. Tutorials, Projects & OpinionsData Distribution Shifts and MonitoringDeploying a model to production isn't the end of the process because the model's performance will degrade over time. In this easy-to-follow deep dive, Chip Huyen explores the issues, including data distribution shifts, monitoring, and typical causes of ML failures. This post is intended for a machine learning systems design course at Stanford. Privacy-preserving insurance quotesConcrete Numpy is an open-source python package that compiles various numpy functions into their Fully Homomorphic Encryption (FHE) equivalents. In other words, Concrete Numpy allows models to work with sensitive data while encrypted. This tutorial walks through basic concepts of how Concrete Numpy works and how to use it. Faster Python calculations with NumbaIf you're writing array-oriented Python code that uses For loops, it doesn't help that NumPy is fast because the For loops are in Python, so it's slow. Here's how numba can get you a 13x speed increase with just two lines of code. Read: The Best Tools for ETL in 2022Learn to efficiently integrate your data sources and get to analysis Code & ToolsD-TaleD-Tale is a visualization tool that makes it easy to view and analyze Pandas data structures. D-Tale supports a variety of pandas objects and it works seamlessly with Jupyter notebooks and python terminals. There's a lot of info here, including links to demos, tutorials and articles. Ask HN: Tools to visualize data in SQL databases?Nice discussion about the various tools that are available to visualize data in a SQL table. There are a variety of use-cases here and discussion of pros/cons for both off-the-shelf products and open-source options. ResourcesThe Effect: An Intro to Research Design and CausalityThis new book is a great introduction to design-based causal inference. The first half takes an intuitive approach to develop an understanding for research design. The second half is more technical and introduces a standard toolset for doing causal inference. The entire book is written in a conversational style that's easy to follow. Free to read online. Data VisualizationThis map went viral! Here's how to make it.There's a lot of data presented in this state by state, stream graph representation of population data in the U.S. It's part of a bigger project that was done for a law firm and it's super effective. Here's a step-by-step tutorial showing how to make maps like this using R. To find specific content from prior issues or to research topics, check out the searchable Archives on Data Elixir's Search Page >> |
Older messages
Data Elixir - Issue 373
Tuesday, February 8, 2022
How data businesses work. Salaries dropping. ML monitoring research challenges. Python setup for DS. State of Data Viz.
Data Elixir - Issue 372
Tuesday, February 1, 2022
Predicting experiments. 🟩🟩🟩🟩🟩. Intro to probabilistic programming. Future of the data warehouse. Bad stat critiques.
Data Elixir - Issue 371
Tuesday, January 25, 2022
Research highlights of 2021. Faster Python. Too much data? SQL alternatives. Mistakes included. AI warfare.
Data Elixir - Issue 370
Tuesday, January 18, 2022
ML: 2021 and beyond. State of ML in Julia. Bayesian Modeling w/ Python. Shiny databases. Lead scoring w/ logistic regression. Beautiful plotting in R.
Data Elixir - Issue 369
Tuesday, January 11, 2022
Tactical career planning. The data-to-engineer ratio. ML generalization. Spiral graphs. DS management: the first year. Interview prep guide.
You Might Also Like
Import AI 399: 1,000 samples to make a reasoning model; DeepSeek proliferation; Apple's self-driving car simulator
Friday, February 14, 2025
What came before the golem? ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
Defining Your Paranoia Level: Navigating Change Without the Overkill
Friday, February 14, 2025
We've all been there: trying to learn something new, only to find our old habits holding us back. We discussed today how our gut feelings about solving problems can sometimes be our own worst enemy
5 ways AI can help with taxes 🪄
Friday, February 14, 2025
Remotely control an iPhone; 💸 50+ early Presidents' Day deals -- ZDNET ZDNET Tech Today - US February 10, 2025 5 ways AI can help you with your taxes (and what not to use it for) 5 ways AI can help
Recurring Automations + Secret Updates
Friday, February 14, 2025
Smarter automations, better templates, and hidden updates to explore 👀 ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
The First Provable AI-Proof Game: Introducing Butterfly Wings 4
Friday, February 14, 2025
Top Tech Content sent at Noon! Boost Your Article on HackerNoon for $159.99! Read this email in your browser How are you, @newsletterest1? undefined The Market Today #01 Instagram (Meta) 714.52 -0.32%
GCP Newsletter #437
Friday, February 14, 2025
Welcome to issue #437 February 10th, 2025 News BigQuery Cloud Marketplace Official Blog Partners BigQuery datasets now available on Google Cloud Marketplace - Google Cloud Marketplace now offers
Charted | The 1%'s Share of U.S. Wealth Over Time (1989-2024) 💰
Friday, February 14, 2025
Discover how the share of US wealth held by the top 1% has evolved from 1989 to 2024 in this infographic. View Online | Subscribe | Download Our App Download our app to see thousands of new charts from
The Great Social Media Diaspora & Tapestry is here
Friday, February 14, 2025
Apple introduces new app called 'Apple Invites', The Iconfactory launches Tapestry, beyond the traditional portfolio, and more in this week's issue of Creativerly. Creativerly The Great
Daily Coding Problem: Problem #1689 [Medium]
Friday, February 14, 2025
Daily Coding Problem Good morning! Here's your coding interview problem for today. This problem was asked by Google. Given a linked list, sort it in O(n log n) time and constant space. For example,
📧 Stop Conflating CQRS and MediatR
Friday, February 14, 2025
Stop Conflating CQRS and MediatR Read on: my website / Read time: 4 minutes The .NET Weekly is brought to you by: Step right up to the Generative AI Use Cases Repository! See how MongoDB powers your