Data Elixir - Data Elixir - Issue 388
ISSUE 388 · May 24, 2022In the NewsUsing ML to Help Protect the Great Barrier ReefIn spite of the costs, machine learning has been successfully used in a variety of conservation projects around the world. Here's an inside look at how the Great Barrier Reef Foundation leveraged the latest technologies to survey, monitor and map reefs at scale. OrganizationsDon’t just run your data team like a product team, run it like a company that needs to scaleData teams are always under-resourced, but simultaneously can be seen as an already expensive investment. Here are some ideas for getting the support your data team needs. Sponsored LinkHow to Capture Advantages by Investing in High-Quality Training DataAt the enterprise level, machine learning requires either large amounts of training data or a smaller set of extremely high quality data, as well as the infrastructure to support high data volumes. Consequently, labeling data through robust software or in partnership with an annotation service provider is critical to project success. Read more. Tutorials, Projects & OpinionsHow random forests really workIn this notebook tutorial, Jeremy Howard from fast.ai shows how Random Forests work, by building one from scratch, and then using it to submit to a Kaggle competition. Visualizing multicollinearity in PythonMulticollinearity is when two or more features are correlated with each other in a dataset and it's important to identify and understand it prior to training predictive models. This post explores three ways to visualize multicollinearity, including pros/cons of each. MarginaliaIn the world of statistics, “marginal” means “additional,” or what happens to outcome variable y when explanatory variable x changes a little. This isn't short but it's a gentle introduction to all things marginal and how they work: marginal effects, marginal slopes, average marginal effects, marginal effects at the mean, and more. Unlock Secret Knowledge from Data Experts for $10Packt's Spring Sale is on and for a limited period, all eBooks and Videos are only $10. Our Products are available as PDF, ePub, and MP4 files for you to download and keep forever. All the practical content you need - by developers for developers. ResourcesSoftware Development Resources for Data ScientistsGreat collection of resources that will help data teams create reproducible and production-ready code and tools. This is a crowd-sourced collection covering project structure, automatated testing, reproducible environments, and version control. Mathematics for Machine LearningThis is a tightly curated collection of free books, videos, and papers for learning mathematics for machine learning. Covers all levels. Code & ToolsLineaPyLineaPy is a Python package for data scientists that makes it easy to go from prototype to production. Just add two lines of code and LineaPy will automatically capture, analyze, and transform messy data science code to production data pipelines. No refactoring or new tools needed. NannyMLNannyML is an open-source python library that estimates real-world model performance (without access to targets), detects data drift, and links data drift alerts to changes in model performance. It's easy to use, model-agnostic and supports all tabular binary classification use cases. Obsidian DataviewDataview is a data index and query language over Markdown files. It's designed as an Obsidian plugin and will give you superpowers with your Obsidian Vaults. If you're not familiar with it, Obsidian is a free graph knowledge base that works on top of a local folder of Markdown files and is great for things like note taking, book development, ideation,
etc. Was this email forwarded to you? Sign up here >> |
Older messages
Data Elixir - Issue 387
Tuesday, May 17, 2022
Supervised clustering. Bandits for recommender systems. JavaScript for R. Teaching data science at scale.
Data Elixir - Issue 386
Tuesday, May 10, 2022
Trusting your data. How to protect your models. How to hire for DS roles. Horizon charts.
Data Elixir - Issue 385
Tuesday, May 3, 2022
Making data actionable. Using BIG AI models in a startup. From academia to industry. ML validity. Mental models for visualization.
Data Elixir - Issue 384
Tuesday, April 26, 2022
Data tests. Null Island. Confidence intervals for ML classifiers. Containers for ML. Performance utilities for regression modeling.
Data Elixir - Issue 383
Tuesday, April 19, 2022
Data teams: embedded or centralized? Unskilled and unaware of it. Counterfactual evaluation. Quant UX vs data science.
You Might Also Like
WP Weekly 226 - Launches - New Elementor Theme, WP 6.8 in April 2025, Automattic Scale Back
Monday, January 13, 2025
Read on Website WP Weekly 226 / Launches 2025 has just started, and there is a slew of new launches like Hello Biz Theme, Meta Box Lite, FooConvert, Affililink, and more. Also, the next WordPress 6.8
SRE Weekly Issue #459
Monday, January 13, 2025
View on sreweekly.com A message from our sponsor, incident.io: Effective incident management demands coordination and collaboration to minimize disruptions. This guide by incident.io covers the full
Saving One Screen At A Time 🖥️
Monday, January 13, 2025
Why the screen saver stopped being so in-your-face. Here's a version for your browser. Hunting for the end of the long tail • January 12, 2025 Today in Tedium: Having seen a lot of pipes, wavy
Software Testing Weekly - Issue 253
Monday, January 13, 2025
Software Testing Weekly turns 5! 🥳 View on the Web Archives ISSUE 253 January 13th 2025 COMMENT Welcome to the 253rd issue! Oh my, time flies! It's hard to believe this week marks 5 years since I
CES 2025 - Sync #501
Sunday, January 12, 2025
Plus: Sam Altman reflects on the last two years; Anthropic reportedly in talks to raise $2B at $60B valuation; e-tattoo decodes brainwaves; anthrobots; top 25 biotech companies for 2025; and more! ͏ ͏
PD#608 Mistakes engineers make in large established codebases
Sunday, January 12, 2025
You can't practice it beforehand ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
C#539 A detailed look at EF Core’s JSON Columns feature
Sunday, January 12, 2025
Comparing it with the traditional tables with indexes
RD#488 How to avoid issues with custom Hooks
Sunday, January 12, 2025
Using them carelessly can lead to many problems
Daily Coding Problem: Problem #1666 [Easy]
Sunday, January 12, 2025
Daily Coding Problem Good morning! Here's your coding interview problem for today. This problem was asked by Amazon. Given n numbers, find the greatest common denominator between them. For example,
🛜 Here's What Happens to Old Websites — Features the Pixel Should Copy From Samsung's One UI 7
Sunday, January 12, 2025
Also: What Instagram Needs to Compete With TikTok, and More! How-To Geek Logo January 12, 2025 Did You Know Mount Wingen, located near Wingen, New South Wales in Australia, is better known as Burning