Data Elixir - Data Elixir - Issue 387
ISSUE 387 · May 17, 2022In the NewsDistilling the Laws of Physics From Raw DataResearchers say we’re on the cusp of “GoPro physics,” where a camera can point at an event and an algorithm can identify the underlying physics equation. If you've studied genetic algorithms, the concepts here will feel familiar but data science is taking it to a whole new level. OrganizationsLet's talk a bit about giving interviewsAdvice is readily available for doing well in data science interviews when you're the one being interviewed. But interviews are completely different on the other side of the table. This is an insightful post for how to approach interviews when you're the one doing the interviewing. Supercharge your Postgres for time-series dataTimescaleDB is the open-source relational database for time-series and analytics, built by developers for developers. It brings together the familiarity of PostgreSQL with speed and petabyte scale. Try it for free (no credit card required). Tutorials, Projects & OpinionsBandits for Recommender SystemsBandits make great recommenders when new items are continually added (e.g., news, tweets) or when there's low traffic. While regular recsys tends to be greedy, bandits model uncertainty and deliberately explore. This is a great introduction to bandits with a focus on examples from industry. Includes linked references throughout. Supervised Clustering: How to Use SHAP Values for Better Cluster AnalysisCluster analysis is a popular method for identifying subgroups within a population, but the results are often challenging to interpret and action. Supervised clustering leverages SHAP values to identify better-separated clusters using a more structured representation of the data. Here's how the technique works, with clear examples along the way. Using Pyodide to Teach Data Science at ScalePyodide makes it possible to install and run Python packages in a browser. Pandas Tutor visualizes pandas chained method calls, step-by-step. Together, it's a browser-based data science education tool that easily scales. This post explores the use-case, the possibilities, and what it took to get Pyodide and Pandas Tutor to work together. PostgresMLPostgresML is an end-to-end machine learning system that enables you to train models and make online predictions using only SQL. The goal is that anyone with a basic understanding of SQL should be able to build, deploy and maintain ML models in production. Transformers for Natural Language ProcessingDon’t get left out in the cold stuck on one transformer platform. Learn how to use the right transformer, platform, and techniques to solve complex NLP problems with Denis Rothman’s new book, Transformers for Natural Language Processing, 2nd Edition. Available now on Amazon and the Packt website. ResourcesJavaScript for RThis new book shows how R and JavaScript can work together and it's much more than just JavaScript code running alongside R. This book shows how R and JavaScript can actively interact with each other and do things that aren't possible otherwise. Free to read online. Data VisualizationSince the critiques have started to roll in...The New York Times published a grim data visualization over the weekend called "One Million Lost." It chronicles the million+ COVID deaths in the U.S. and it's a stunning portrayal of the pandemic. But is it effective? In this thread, Will Chase explores the challenges of visualizing death and what it accomplishes. Was this email forwarded to you? Sign up here >> |
Older messages
Data Elixir - Issue 386
Tuesday, May 10, 2022
Trusting your data. How to protect your models. How to hire for DS roles. Horizon charts.
Data Elixir - Issue 385
Tuesday, May 3, 2022
Making data actionable. Using BIG AI models in a startup. From academia to industry. ML validity. Mental models for visualization.
Data Elixir - Issue 384
Tuesday, April 26, 2022
Data tests. Null Island. Confidence intervals for ML classifiers. Containers for ML. Performance utilities for regression modeling.
Data Elixir - Issue 383
Tuesday, April 19, 2022
Data teams: embedded or centralized? Unskilled and unaware of it. Counterfactual evaluation. Quant UX vs data science.
Data Elixir - Issue 382
Tuesday, April 12, 2022
Quarto. ML notebook tutorials. Reproducible & trustworthy workflows. Real-world recommenders. Graph-based outlier detection.
You Might Also Like
WP Weekly 226 - Launches - New Elementor Theme, WP 6.8 in April 2025, Automattic Scale Back
Monday, January 13, 2025
Read on Website WP Weekly 226 / Launches 2025 has just started, and there is a slew of new launches like Hello Biz Theme, Meta Box Lite, FooConvert, Affililink, and more. Also, the next WordPress 6.8
SRE Weekly Issue #459
Monday, January 13, 2025
View on sreweekly.com A message from our sponsor, incident.io: Effective incident management demands coordination and collaboration to minimize disruptions. This guide by incident.io covers the full
Saving One Screen At A Time 🖥️
Monday, January 13, 2025
Why the screen saver stopped being so in-your-face. Here's a version for your browser. Hunting for the end of the long tail • January 12, 2025 Today in Tedium: Having seen a lot of pipes, wavy
Software Testing Weekly - Issue 253
Monday, January 13, 2025
Software Testing Weekly turns 5! 🥳 View on the Web Archives ISSUE 253 January 13th 2025 COMMENT Welcome to the 253rd issue! Oh my, time flies! It's hard to believe this week marks 5 years since I
CES 2025 - Sync #501
Sunday, January 12, 2025
Plus: Sam Altman reflects on the last two years; Anthropic reportedly in talks to raise $2B at $60B valuation; e-tattoo decodes brainwaves; anthrobots; top 25 biotech companies for 2025; and more! ͏ ͏
PD#608 Mistakes engineers make in large established codebases
Sunday, January 12, 2025
You can't practice it beforehand ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
C#539 A detailed look at EF Core’s JSON Columns feature
Sunday, January 12, 2025
Comparing it with the traditional tables with indexes
RD#488 How to avoid issues with custom Hooks
Sunday, January 12, 2025
Using them carelessly can lead to many problems
Daily Coding Problem: Problem #1666 [Easy]
Sunday, January 12, 2025
Daily Coding Problem Good morning! Here's your coding interview problem for today. This problem was asked by Amazon. Given n numbers, find the greatest common denominator between them. For example,
🛜 Here's What Happens to Old Websites — Features the Pixel Should Copy From Samsung's One UI 7
Sunday, January 12, 2025
Also: What Instagram Needs to Compete With TikTok, and More! How-To Geek Logo January 12, 2025 Did You Know Mount Wingen, located near Wingen, New South Wales in Australia, is better known as Burning