Data Elixir - Data Elixir - Issue 402
ISSUE 402 · August 30, 2022Introducing the Data Elixir Talent CollectiveThe Data Elixir Talent Collective is a reverse job board where top companies apply to you. Members control all communication, so you won't get the noise that's typical on other recruiting channels. Choose to be anonymous or public and get matched with opportunities that fit your specific interests; be it a career move, more pay, remote work, etc. This is a free resource but membership is limited to professionals with 3+ years experience in data science, analytics, ML, visualization, and related fields. The plan is to create the highest signal-to-noise hiring resource for roles in the data ecosystem. For more info, apply here >> Tutorials, Projects & OpinionsCausal Forecasting at Lyft - Part 2In this second post of a series, Sameer Manek and Duane Rich describe the practical side of causal forecasting at Lyft. Specifically, they take a look at the software and algorithms that Lyft uses to bridge the gap between causal theory and implementation. An astronomer's introduction to NumPyroNice introduction to NumPyro, including examples that show how to use NumPyro to do Bayesian inference. The Rise of Data ContractsData Contracts are API-like agreements between engineers who own services and data consumers, like analysts and data scientists, who understand how the business works and have specific data needs in order to generate well-modeled and dependable products. This is a nice intro to data contracts, the problems they solve and how to use them. Real-time data quality tests using dbt and MaterializeHave you ever encountered a data quality issue where you’ve said "Wow, I wish I could have found out about this issue later.” No? Using Materialize as your backend alongside dbt, you can monitor your data pipelines and get notified in real-time when data quality expectations fail. Check out how in this post. Resources⚽ Soccermatics: mathematical modeling of footballDavid Sumpter's new course will show you how to understand the game of football using mathematics, statistics and machine learning. Guest lectures from world-leading practitioners and researchers help make it one of the most comprehensive ⚽ analytics courses anywhere. And if you're not looking for credits, the lessons, videos, and code are all free. Code & ToolsIbis: Expressive analytics in Python at any scaleIbis is a Python framework that lets you access data and perform analytical computations from different sources, in a standard way. You can think of Ibis as writing SQL in Python, but with a focus on analytics. And, aside from SQL databases, you can use it with other data stores, such as Apache Impala, BigQuery, Dask, DuckDB, and more. Data VisualizationHow to visualize a billion rows of data in R with Apache ArrowIn case you ever need to visualize a staggering amount of data, this tutorial by Danielle Navarro shows how to use Arrow to help plot data from a billion-plus taxi rides in New York. This is a practical post that walks through the thinking process for handling a larger-than-RAM dataset and producing a crisp and useful visualization using all the data. HoloViz TutorialHoloViz is an open-source visualization ecosystem that makes it easy to use Python visualization libraries with your data. This is a great, multi-part tutorial that introduces its capabilities and shows how to build and deploy interactive dashboards, how to use HoloViz for big, streaming, and multidimensional data, and how to build interactive exploratory tools without also needing to run a web-tech software project. OutlierStable Diffusion: the most important AI art model yetThere's been a lot of excitement on the Internet this week about the newly released Stable Diffusion. It's an open-source text-to-image diffusion model and considering the quality of the images bouncing around the web this week, it's a very big deal. This post is a great overview and see the release announcement for
links and tech details. Was this email forwarded to you? Sign up here >> |
Older messages
Data Elixir - Issue 401
Tuesday, August 23, 2022
Homegrown auth w/ ML. Intro to backprop. Key-value DBs. GPT-3 for science. Data product canvas. R-spatial ecosystem.
Data Elixir - Issue 400
Tuesday, August 16, 2022
Deep dive into SVD. Smart paywalls. Idea to funding. Bayesian inference at scale. Logistic regression explainer.
Data Elixir - Issue 399
Tuesday, August 9, 2022
The 8 slide resume. Intro to streaming for data scientists. Random Forest explainer.
Data Elixir - Issue 398
Tuesday, August 2, 2022
Building modern data teams. Art From Code. Jupyter for code development & publishing. DS guide to statistical genetics. Nuanced metrics.
Data Elixir - Issue 397
Tuesday, July 26, 2022
Guide to sports analytics. Tensor Puzzles 🧩. Betting on data. DL for tabular data. Results vs Accuracy.
You Might Also Like
Saving One Screen At A Time 🖥️
Monday, January 13, 2025
Why the screen saver stopped being so in-your-face. Here's a version for your browser. Hunting for the end of the long tail • January 12, 2025 Today in Tedium: Having seen a lot of pipes, wavy
Software Testing Weekly - Issue 253
Monday, January 13, 2025
Software Testing Weekly turns 5! 🥳 View on the Web Archives ISSUE 253 January 13th 2025 COMMENT Welcome to the 253rd issue! Oh my, time flies! It's hard to believe this week marks 5 years since I
CES 2025 - Sync #501
Sunday, January 12, 2025
Plus: Sam Altman reflects on the last two years; Anthropic reportedly in talks to raise $2B at $60B valuation; e-tattoo decodes brainwaves; anthrobots; top 25 biotech companies for 2025; and more! ͏ ͏
PD#608 Mistakes engineers make in large established codebases
Sunday, January 12, 2025
You can't practice it beforehand ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
C#539 A detailed look at EF Core’s JSON Columns feature
Sunday, January 12, 2025
Comparing it with the traditional tables with indexes
RD#488 How to avoid issues with custom Hooks
Sunday, January 12, 2025
Using them carelessly can lead to many problems
Daily Coding Problem: Problem #1666 [Easy]
Sunday, January 12, 2025
Daily Coding Problem Good morning! Here's your coding interview problem for today. This problem was asked by Amazon. Given n numbers, find the greatest common denominator between them. For example,
🛜 Here's What Happens to Old Websites — Features the Pixel Should Copy From Samsung's One UI 7
Sunday, January 12, 2025
Also: What Instagram Needs to Compete With TikTok, and More! How-To Geek Logo January 12, 2025 Did You Know Mount Wingen, located near Wingen, New South Wales in Australia, is better known as Burning
☁️ Azure Weekly #498 - 12th January 2025
Sunday, January 12, 2025
Festive Tech Calendar 2024 recap, GitHub Copilot Bootcamp, and Dev Containers FTW!
Sunday Digest | Featuring 'The Income Needed to Join the Top 1% in Every U.S. State' 📊
Sunday, January 12, 2025
Every visualization published this week, in one place. Jan 12, 2025 | View Online | Subscribe | VC+ | Download Our App Hello, welcome to your Sunday Digest. This week, we visualized the value of the