Data Elixir - Data Elixir - Issue 403
ISSUE 403 · September 6, 2022Join the Data Elixir Talent CollectiveThe Data Elixir Talent Collective is a reverse job board where top companies apply to you. Members control all communication, so you won't get the noise that's typical on other recruiting channels. Choose to be anonymous or public and get matched with opportunities that fit your specific interests; be it a career move, more pay, remote work, etc. The Collective is off to a great start with 75 members in its first week. This is a free resource but membership is limited to professionals with 3+ years experience in data science, analytics, ML, visualization, and related fields. The plan is to create the highest signal-to-noise hiring resource for roles in the data ecosystem. For more info, apply here. InsightsDown the Semantic Rabbit HoleThe purpose of a semantic layer is to close the gap between the “business language” and the “data language” and offer a unified and consistent view of the business. This deep dive into the topic is one of the more complete explainers I’ve come across. Be Good-argument-Driven, Not Data-drivenWhen you begin to favor *bad* arguments that involve data over *good* arguments that don’t, then you’ll notice things start to go awry. Richard Marmorstein offers a reminder for us data folks to consider foundational elements of the argument itself before leaning on metrics. Tutorials, Projects & OpinionsCommunicating A/B Test Results with Ratios and Uncertainty IntervalsIf you work with A/B tests frequently then the question of how to communicate uncertainty has likely come to mind. This argument for presenting A/B test tests for conversion rates as ratios, with uncertainty intervals around those ratios makes a lot of sense to me. How Instacart Uses Machine Learning-Driven Autocomplete to Help People Fill Their CartsSearch is a big deal for eCommerce platforms like Instacart. This post describes how the team generates and ranks query suggestions in autocomplete and how this shapes a user’s search behavior — translating into larger basket sizes. Measuring Downstream Impact on Social Networks by Using an Attribution FrameworkIt’s not the easiest task to quantify the impact that a certain action has in social networks because those actions have secondary impacts across other members. Here, LinkedIn introduces a practical framework for running experiments in these types of environments. Bayesian Age/Period/Cohort Models in Python with PyMCPyMC can be a powerful tool in the right hands. Austin Rochford does a good job of showcasing this through Bayesian APC models, including lots of code snippets for getting you ready to tackle the inferential challenges these models pose. Creating a Real-Time Feature Store with MaterializeWhy is everyone talking about feature stores, yet so few are (successfully) implementing them? In this post, we walk you through how Materialize can be used to create a real-time feature store for a fraud detection use case Code & ToolsR Shiny Now Available in PythonFor the uninitiated, Shiny is a package that makes it easy to build interactive web applications and dashboards. It was previously limited to the R programming language but recently, the creators of Shiny have announced Shiny for Python as well! DocQuery: Document Query EngineDocQuery is a Python library that makes it easy to extract information from documents. Works with semi-structured and unstructured documents, such as PDFs, scanned images, etc. Simply point DocQuery at one or more documents and specify a question you want to ask. Data VisualizationThe Magic of Matplotlib StylesheetsThe “out of the box” visualizations in Matplotlib aren’t fantastic. Luckily, stylesheets can level up your plots in a very accessible manner. This overview takes you from fairly uninspiring visualizations to something much more compelling. Was this email forwarded to you? Sign up here >> |
Older messages
Data Elixir - Issue 402
Tuesday, August 30, 2022
Modeling and analytics for ⚽. Practical causal forecasting. Intro to data contracts. Expressive analytics w/ Python.
Data Elixir - Issue 401
Tuesday, August 23, 2022
Homegrown auth w/ ML. Intro to backprop. Key-value DBs. GPT-3 for science. Data product canvas. R-spatial ecosystem.
Data Elixir - Issue 400
Tuesday, August 16, 2022
Deep dive into SVD. Smart paywalls. Idea to funding. Bayesian inference at scale. Logistic regression explainer.
Data Elixir - Issue 399
Tuesday, August 9, 2022
The 8 slide resume. Intro to streaming for data scientists. Random Forest explainer.
Data Elixir - Issue 398
Tuesday, August 2, 2022
Building modern data teams. Art From Code. Jupyter for code development & publishing. DS guide to statistical genetics. Nuanced metrics.
You Might Also Like
Import AI 399: 1,000 samples to make a reasoning model; DeepSeek proliferation; Apple's self-driving car simulator
Friday, February 14, 2025
What came before the golem? ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
Defining Your Paranoia Level: Navigating Change Without the Overkill
Friday, February 14, 2025
We've all been there: trying to learn something new, only to find our old habits holding us back. We discussed today how our gut feelings about solving problems can sometimes be our own worst enemy
5 ways AI can help with taxes 🪄
Friday, February 14, 2025
Remotely control an iPhone; 💸 50+ early Presidents' Day deals -- ZDNET ZDNET Tech Today - US February 10, 2025 5 ways AI can help you with your taxes (and what not to use it for) 5 ways AI can help
Recurring Automations + Secret Updates
Friday, February 14, 2025
Smarter automations, better templates, and hidden updates to explore 👀 ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
The First Provable AI-Proof Game: Introducing Butterfly Wings 4
Friday, February 14, 2025
Top Tech Content sent at Noon! Boost Your Article on HackerNoon for $159.99! Read this email in your browser How are you, @newsletterest1? undefined The Market Today #01 Instagram (Meta) 714.52 -0.32%
GCP Newsletter #437
Friday, February 14, 2025
Welcome to issue #437 February 10th, 2025 News BigQuery Cloud Marketplace Official Blog Partners BigQuery datasets now available on Google Cloud Marketplace - Google Cloud Marketplace now offers
Charted | The 1%'s Share of U.S. Wealth Over Time (1989-2024) 💰
Friday, February 14, 2025
Discover how the share of US wealth held by the top 1% has evolved from 1989 to 2024 in this infographic. View Online | Subscribe | Download Our App Download our app to see thousands of new charts from
The Great Social Media Diaspora & Tapestry is here
Friday, February 14, 2025
Apple introduces new app called 'Apple Invites', The Iconfactory launches Tapestry, beyond the traditional portfolio, and more in this week's issue of Creativerly. Creativerly The Great
Daily Coding Problem: Problem #1689 [Medium]
Friday, February 14, 2025
Daily Coding Problem Good morning! Here's your coding interview problem for today. This problem was asked by Google. Given a linked list, sort it in O(n log n) time and constant space. For example,
📧 Stop Conflating CQRS and MediatR
Friday, February 14, 2025
Stop Conflating CQRS and MediatR Read on: my website / Read time: 4 minutes The .NET Weekly is brought to you by: Step right up to the Generative AI Use Cases Repository! See how MongoDB powers your