Data Elixir - Data Elixir - Issue 388
ISSUE 388 · May 24, 2022In the NewsUsing ML to Help Protect the Great Barrier ReefIn spite of the costs, machine learning has been successfully used in a variety of conservation projects around the world. Here's an inside look at how the Great Barrier Reef Foundation leveraged the latest technologies to survey, monitor and map reefs at scale. OrganizationsDon’t just run your data team like a product team, run it like a company that needs to scaleData teams are always under-resourced, but simultaneously can be seen as an already expensive investment. Here are some ideas for getting the support your data team needs. Sponsored LinkHow to Capture Advantages by Investing in High-Quality Training DataAt the enterprise level, machine learning requires either large amounts of training data or a smaller set of extremely high quality data, as well as the infrastructure to support high data volumes. Consequently, labeling data through robust software or in partnership with an annotation service provider is critical to project success. Read more. Tutorials, Projects & OpinionsHow random forests really workIn this notebook tutorial, Jeremy Howard from fast.ai shows how Random Forests work, by building one from scratch, and then using it to submit to a Kaggle competition. Visualizing multicollinearity in PythonMulticollinearity is when two or more features are correlated with each other in a dataset and it's important to identify and understand it prior to training predictive models. This post explores three ways to visualize multicollinearity, including pros/cons of each. MarginaliaIn the world of statistics, “marginal” means “additional,” or what happens to outcome variable y when explanatory variable x changes a little. This isn't short but it's a gentle introduction to all things marginal and how they work: marginal effects, marginal slopes, average marginal effects, marginal effects at the mean, and more. Unlock Secret Knowledge from Data Experts for $10Packt's Spring Sale is on and for a limited period, all eBooks and Videos are only $10. Our Products are available as PDF, ePub, and MP4 files for you to download and keep forever. All the practical content you need - by developers for developers. ResourcesSoftware Development Resources for Data ScientistsGreat collection of resources that will help data teams create reproducible and production-ready code and tools. This is a crowd-sourced collection covering project structure, automatated testing, reproducible environments, and version control. Mathematics for Machine LearningThis is a tightly curated collection of free books, videos, and papers for learning mathematics for machine learning. Covers all levels. Code & ToolsLineaPyLineaPy is a Python package for data scientists that makes it easy to go from prototype to production. Just add two lines of code and LineaPy will automatically capture, analyze, and transform messy data science code to production data pipelines. No refactoring or new tools needed. NannyMLNannyML is an open-source python library that estimates real-world model performance (without access to targets), detects data drift, and links data drift alerts to changes in model performance. It's easy to use, model-agnostic and supports all tabular binary classification use cases. Obsidian DataviewDataview is a data index and query language over Markdown files. It's designed as an Obsidian plugin and will give you superpowers with your Obsidian Vaults. If you're not familiar with it, Obsidian is a free graph knowledge base that works on top of a local folder of Markdown files and is great for things like note taking, book development, ideation,
etc. Was this email forwarded to you? Sign up here >> |
Older messages
Data Elixir - Issue 387
Tuesday, May 17, 2022
Supervised clustering. Bandits for recommender systems. JavaScript for R. Teaching data science at scale.
Data Elixir - Issue 386
Tuesday, May 10, 2022
Trusting your data. How to protect your models. How to hire for DS roles. Horizon charts.
Data Elixir - Issue 385
Tuesday, May 3, 2022
Making data actionable. Using BIG AI models in a startup. From academia to industry. ML validity. Mental models for visualization.
Data Elixir - Issue 384
Tuesday, April 26, 2022
Data tests. Null Island. Confidence intervals for ML classifiers. Containers for ML. Performance utilities for regression modeling.
Data Elixir - Issue 383
Tuesday, April 19, 2022
Data teams: embedded or centralized? Unskilled and unaware of it. Counterfactual evaluation. Quant UX vs data science.
You Might Also Like
Import AI 399: 1,000 samples to make a reasoning model; DeepSeek proliferation; Apple's self-driving car simulator
Friday, February 14, 2025
What came before the golem? ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
Defining Your Paranoia Level: Navigating Change Without the Overkill
Friday, February 14, 2025
We've all been there: trying to learn something new, only to find our old habits holding us back. We discussed today how our gut feelings about solving problems can sometimes be our own worst enemy
5 ways AI can help with taxes 🪄
Friday, February 14, 2025
Remotely control an iPhone; 💸 50+ early Presidents' Day deals -- ZDNET ZDNET Tech Today - US February 10, 2025 5 ways AI can help you with your taxes (and what not to use it for) 5 ways AI can help
Recurring Automations + Secret Updates
Friday, February 14, 2025
Smarter automations, better templates, and hidden updates to explore 👀 ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
The First Provable AI-Proof Game: Introducing Butterfly Wings 4
Friday, February 14, 2025
Top Tech Content sent at Noon! Boost Your Article on HackerNoon for $159.99! Read this email in your browser How are you, @newsletterest1? undefined The Market Today #01 Instagram (Meta) 714.52 -0.32%
GCP Newsletter #437
Friday, February 14, 2025
Welcome to issue #437 February 10th, 2025 News BigQuery Cloud Marketplace Official Blog Partners BigQuery datasets now available on Google Cloud Marketplace - Google Cloud Marketplace now offers
Charted | The 1%'s Share of U.S. Wealth Over Time (1989-2024) 💰
Friday, February 14, 2025
Discover how the share of US wealth held by the top 1% has evolved from 1989 to 2024 in this infographic. View Online | Subscribe | Download Our App Download our app to see thousands of new charts from
The Great Social Media Diaspora & Tapestry is here
Friday, February 14, 2025
Apple introduces new app called 'Apple Invites', The Iconfactory launches Tapestry, beyond the traditional portfolio, and more in this week's issue of Creativerly. Creativerly The Great
Daily Coding Problem: Problem #1689 [Medium]
Friday, February 14, 2025
Daily Coding Problem Good morning! Here's your coding interview problem for today. This problem was asked by Google. Given a linked list, sort it in O(n log n) time and constant space. For example,
📧 Stop Conflating CQRS and MediatR
Friday, February 14, 2025
Stop Conflating CQRS and MediatR Read on: my website / Read time: 4 minutes The .NET Weekly is brought to you by: Step right up to the Generative AI Use Cases Repository! See how MongoDB powers your