Data Elixir - Data Elixir - Issue 395
ISSUE 395 · July 12, 2022TrendsData teams are getting larger, fasterData teams are getting bigger, faster but something happens when a team grows past 10 people. You no longer know if the data you use is reliable, the lineage is too large to make sense of and end-users start complaining about data issues every other day. In his latest post, Mikkel Dengsøe explores the issues and ways to handle them as a team scales. In the NewsHow likely is it that the audits of Comey and McCabe were a coincidence? A statistical exploration.It's not often that a news source will teach statistics to expose a story. But without the statistics, you're just left with accusations that could be easy to brand as fake news. How likely is it that the audits were a coincidence? The answer is harder to determine than you might think. Sponsored LinkDoubleCloud: make the most of your data in the cloudCollect, aggregate and migrate data from multiple sources to build sub-second analytics on fully managed data stacks with proven open-source technologies. Perfect solution for unlocking the potential in your data. Start your trial today Tutorials, Projects & OpinionsSuccess Metrics for Product AnalyticsGreat post on success metrics and how ultimately, they're not a replacement for strategy. They're a way to confirm that the strategy was executed successfully. 4 Pandas Anti-Patterns to Avoid and How to Fix Thempandas users often learn suboptimal coding practices that become their default workflows. This post highlights four common pandas anti-patterns and outlines a set of techniques that you should use instead. MLOps: Overview, Definition, and ArchitectureGreat overview of Machine Learning Operations (MLOps) based on a mixed-method research approach, including a literature review, a tool review, and expert interviews. This is a wide-ranging paper that covers MLOps principles, components, roles, architecture, and workflows. There's a lot that's still being figured out in MLOps but for anyone who might think MLOps is simple, check out Figure 4! ResourcesPython for Data Analysis, 3rd EditionThe third edition of Wes McKinney's Python for Data Analysis has just been released and is free to read online. This is a practical, hands-on guide for manipulating, processing, cleaning, and analyzing datasets with Python. This new edition brings the content up-to-date from 2017. Data in WonderlandStorytelling can enhance others' understanding of data, especially when combined with analysis and visualization. In this online book, Scott Spencer shows how — covering everything from writing style to visualization & interactives to how to give a good presentation. Data VisualizationCareer Portraits in Data VisualizationA lot of people are involved in producing data visualizations. Among analysts, designers, developers, and engineers — who does what, exactly? And what are their backgrounds? This is a great deep dive, based on the Data Visualization Society's "State of the Industry" survey. Multi-scale model assessment with spatialsampleNice tutorial that shows how (and why!) to use the new {spatialsample} rstats package to model spatially structured data. OutlierUsing GPT-3 to explain how code worksAwesome use-case for GPT-3. Could GPT-3 really explain how sections of code work? Check this out. "It’s shockingly effective." Was this email forwarded to you? Sign up here >> |
Older messages
Data Elixir - Issue 394
Tuesday, July 5, 2022
Causal forecasting. ML exercises for pen and paper. Mixed effects models tutorial. Geo-based A/B testing.
Data Elixir - Issue 393
Tuesday, June 28, 2022
What Julia gets right. ML stack trends. Scraped data: fair game? Things you should know about DBs. Investment research platform for DS.
Data Elixir - Issue 392
Tuesday, June 21, 2022
Graph ML intro. JTBD for data teams. How to get data out of PDFs. Research highlights from Meta. Reproducible research workflows.
Data Elixir - Issue 391
Tuesday, June 14, 2022
ML design patterns. Awesome data leadership. Faster Pandas. ML experimentation in VS Code.
Data Elixir - Issue 390
Tuesday, June 7, 2022
Friendlier SQL. Decision Intelligence framework. Collective data rights.
You Might Also Like
💻 Installing Linux on an Old Laptop Instead of a Raspberry Pi — Flagship Phones Need More Storage
Monday, November 18, 2024
Also: I Built the Perfect Programming Platform In Less Than 10 Minutes, and More! How-To Geek Logo November 18, 2024 Did You Know The Sixth Sense was the highest-grossing horror film of all time in
Daily Coding Problem: Problem #1612 [Hard]
Monday, November 18, 2024
Daily Coding Problem Good morning! Here's your coding interview problem for today. This problem was asked by Etsy. Given a sorted array, convert it into a height-balanced binary search tree.
10,000 ways to fail & The European Search Perspective
Monday, November 18, 2024
Reflecting on over five years of Creativerly, Signal introduces Call Links, the science of mental models, and a lot more in this week's issue of Creativerly. Creativerly 10000 ways to fail &
Charted | Global GHG Emissions, by Sector 🌎
Monday, November 18, 2024
In this graphic, we show greenhouse gas emissions by sector in 2023. View Online | Subscribe | Download Our App Presented by: New 3-Part Series: Bitcoin Demystified >> Learn more about one of the
Spyglass Dispatch: Samsung/Google Smart Glasses • Star Wars Mess • Netflix Knocked Out • Conan's Oscars • MicroStrategy's Comeback • Vision Pro In Focus • Saving 'Inside the NBA' • Apple Television Lives!
Monday, November 18, 2024
Samsung/Google Smart Glasses • Star Wars Mess • Netflix Knocked Out • Conan's Oscars • MicroStrategy's Comeback • Vision Pro In Focus • Saving 'Inside the NBA' • Apple Television Lives!
GCP Newsletter #424
Monday, November 18, 2024
Welcome to issue #425 November 18th, 2024 News Google Kubernetes Engine Official Blog 65000 nodes and counting: Google Kubernetes Engine is ready for trillion-parameter AI models - Google Kubernetes
Design and code beautiful products. Together.
Monday, November 18, 2024
Pablo Ruiz-Múzquiz and the team at Penpot have recently announced a new plugin feature that allows users to build new tools and functionalities on the platform. ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
Can Bitcoin Put an End to Forever War?
Monday, November 18, 2024
Top Tech Content sent at Noon! How the world collects web data Read this email in your browser How are you, @newsletterest1? 🪐 What's happening in tech today, November 18, 2024? The HackerNoon
25 tips for programming with AI
Monday, November 18, 2024
Meta Quest dominates Steam VR; Stop squirting hot glue into devices -- ZDNET ZDNET Tech Today - US November 18, 2024 digitalspeed-gettyimages-1322205545 25 AI tips to boost your programming
Ordering, Grouping and Consistency in Messaging systems
Monday, November 18, 2024
We went quite far from our Queue Broker series in recent editions, but today, we're back to it! By powers combined, I joined our Queue Broker implementation to solve the generic idempotency check