Data Elixir - Data Elixir - Issue 376
ISSUE 376 · March 1, 2022In the NewsIf you've been watching the news and wondering how a data/tech person can best help the situation in Ukraine, this Twitter discussion that Andrew Therriault started is a good place to start. Andrew is a data scientist and the founder of the data-for-good organization called Civin. InsightWhy becoming a data-driven organization is so hardA new survey of executives explores the challenges that many organizations face when trying to become "data-driven." In this article, Randy Bean considers the results from the survey and distills key steps that organizations can take to succeed. For a detailed report, follow the link to the "NewVantage Partners annual survey." Sponsored Link30-Day Trial of Innodata’s New Data Annotation PlatformThis new web-based annotation platform reduces the cost of AI/ML projects while enabling users to develop more accurate models. With easy-to-use workflows, customizable workbenches, real-time KPIs, and auto-annotation capabilities, your team can create high-quality training data on-prem or on the cloud. Sign up for a free 30-day trial today! Tutorials, Projects & OpinionsNotebooks In Production With MetaflowNotebooks in production?! Here's how — and why — new Metaflow features enable you to orchestrate notebook execution in your DAGs and visualize/debug production workflows. Learning with limited data - part 2: Active LearningHere's part 2 of what to do when faced with a limited amount of labeled data for supervised learning tasks. When the labeling budget is limited or the labeling cost is high, active learning is useful for selecting the most valuable samples to label next. Web Scraping with RThis step-by-step tutorial shows how to scrape web pages using R, starting with simple web scraping tasks and then continuing with scraping multiple pages. It's Time to Rethink Your Media DietFounded by investment bankers, The Daily Upside is our favorite source for premium business news that’s actually worth reading. The Daily Upside delivers crisp insights on market-moving stories and teases out nuances you won't read elsewhere. It's completely free, and the sole purpose is to make you a sharp, well-informed
investor. Code & ToolsStatistical ⚡️ ForecastThis new package is said to be the fastest yet autoarima implementation for Python. It's claimed to be 20x faster than pmdarima and 500x faster than MetaAI's Prophet. Includes a collection of widely used univariate time series forecasting models, including exponential smoothing and automatic ARIMA modeling. imodelsInterpretability is important in high-stakes fields like medicine and political science but complicated models are increasingly difficult to interpret. imodels is a new Python package for concise, transparent, and accurate predictive modeling. ResourcesProbabilistic Machine Learning: Advanced TopicsKevin Murphy just released the Advanced Topics edition of his popular Probabilistic Machine Learning series. It's not quite done but it's close and is free to download. The text covers Bayesian inference, causality, reinforcement learning, distribution shift, deep generative models, etc. For the other books in this series, see the pml-book website >> Data VisualizationColors for all!This new R package makes it easy to find the color palettes that will work best for your particular data and use-case. Just select a few parameters that describe your data and cols4all will score a variety of palettes on aspects that will make your visualizations more effective. To find specific content from prior issues or to research topics, check out the searchable Archives on Data Elixir's Search Page >> |
Older messages
Data Elixir - Issue 375
Tuesday, February 22, 2022
Data diff algorithms. Unbundling the data platform. Changing jobs? Watch for these 🚩🚩. Interactive canvas for Jupyter. Finding missing evidence.
Data Elixir - Issue 374
Tuesday, February 15, 2022
Intro to design-based causal inference. Easy EDA for Pandas. Modeling with encrypted data. Data distribution shifts.
Data Elixir - Issue 373
Tuesday, February 8, 2022
How data businesses work. Salaries dropping. ML monitoring research challenges. Python setup for DS. State of Data Viz.
Data Elixir - Issue 372
Tuesday, February 1, 2022
Predicting experiments. 🟩🟩🟩🟩🟩. Intro to probabilistic programming. Future of the data warehouse. Bad stat critiques.
Data Elixir - Issue 371
Tuesday, January 25, 2022
Research highlights of 2021. Faster Python. Too much data? SQL alternatives. Mistakes included. AI warfare.
You Might Also Like
How to avoid spam texts
Tuesday, January 14, 2025
Let me ask you something: How many times have you shared your phone number online this month? Every time you do—whether for a delivery, online shopping, or signing up for a new service—you're
BetterDev #273 - Operating System in 1,000 Lines
Monday, January 13, 2025
Better Dev #273 Jan 12, 2025 Hi all, Happy new year. Welcome to the first issue of 2025. I'm trying to become more regular this year. Looking forward to a new year and hope everyone continue to
Daily Coding Problem: Problem #1667 [Hard]
Monday, January 13, 2025
Daily Coding Problem Good morning! Here's your coding interview problem for today. This problem was asked by Airbnb. We're given a hashmap associating each courseId key with a list of courseIds
🧠 Are Supercomputers Dead? — This 90s Tech Is Perfect for Smart TVs
Monday, January 13, 2025
Also: How to Make Sense of Linux Ping Stats, and More! How-To Geek Logo January 13, 2025 Did You Know The original name of the iconic SR-71 Blackbird was actually the RS-71 Blackbird, but Lyndon
Consistency means nothing & Bluesky is reportedly valued at $700
Monday, January 13, 2025
Sill Beta Update #3, Miro AI starts storing AI interactions from free users, Mastodon transfers to a new non-profit organization, and a lot more in this week's issue of Creativerly. Creativerly
Ranked | The AI Models With the Lowest Hallucination Rates 🤖
Monday, January 13, 2025
Hallucination rate is the frequency that an LLM generates false or unsupported information in its outputs. Which models have the lowest rates? View Online | Subscribe | Download Our App FEATURED STORY
GCP Newsletter #433
Monday, January 13, 2025
Welcome to issue #433 January 13th, 2025 News Official Blog Vertex AI Introducing Vertex AI RAG Engine: Scale your Vertex AI RAG pipeline with confidence - Vertex AI RAG Engine is a fully managed
Spyglass Dispatch: It's Political & Personal
Monday, January 13, 2025
On Meta's Moderation Changes • Inside DOGE • Zuck Slams Apple (Again) • Apple's Muted 2025 • CES 2025 Recap The Spyglass Dispatch is a newsletter sent on weekdays featuring links and commentary
$200 to invest today... (USA Only)
Monday, January 13, 2025
Join me in investing in blue chip art on Masterworks, and you will receive $200 to invest on the platform. Not kidding. Founder interview coming soon! ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
The Sequence Knowledge #468: A New Series About RAG
Monday, January 13, 2025
Exploring key concepts of one of the most popular methods in generative AI solutions. ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏