Data Elixir - Data Elixir - Issue 375
ISSUE 375 · February 22, 2022InsightWhy we use Julia, 10 years laterTen years ago this month, the Julia language was introduced to the world. It's now used by hundreds of thousands of people and taught at universities around the world. If you're not a user already, this post will give you a sense of what it can do and where things are going. And if you are a user, there are a lot of good Twitter feeds here to follow. Sponsored LinkTrust your data at scale with CastorCastor is a collaborative and automated data catalog. It is designed for mass adoption within your company, regardless of data literacy. Deploy in 30 minutes. Explore data lineage. Trust your data. Tutorials, Projects & OpinionsData Diffs: Algorithms for explaining what changed in a datasetNice introduction to "explanation algorithms" and how asking "why?" at the SQL level can be a useful way to identify the changes in a dataset that resulted in a different outcome. Along with exploring approaches, this post introduces an open-source data-differ that, essentially, takes two SQL queries and explains what makes their results different. Lyft and urban mobilityFun post from Mark Huberty that combines ride data from Lyft with graph theory and clustering to learn about urban mobility. The Unbundling of AirflowIf the unbundling of Airflow means all the heavy lifting is done by separate tools, what's left behind? Get training data for ML in record timeDesigned by engineers for engineers, Toloka combines cutting-edge technologies with the power of the crowd to deliver high-performing data for Machine Learning projects in record time. Built-in quality control system provides superb data accuracy at scale. Code & Toolsipycanvas - Interactive Canvas in Jupyteripycanvas is a lightweight, fast and stable library that exposes the browser's Canvas API to IPython. In other words, this toolset makes it easy to draw simple primitives such as text, lines, polygons, arc, etc. directly from Python. For ideas, check out the examples. MitoMito is a spreadsheet that lives inside your JupyterLab notebooks. It allows you to edit Pandas dataframes like an Excel file, and generates Python code that corresponds to each of your edits. CareerRed Flags to Look Out for When Joining a Data TeamThinking about changing jobs? This is a nice collection of insights to help you steer clear of bad surprises. It's tailored for data teams but the ideas here generally apply to most tech roles. For more, check out the Twitter discussion that started it >> Data VisualizationHow to use fewer colors in your data visualizationsIf you've ever tried to decipher a chart with dozens of colors, you know that color isn't always a good thing. Here are 10 ways to use fewer colors in your charts while also making them more understandable. Increasing Flexibility & Robustness of Plots in ggplot2In this step-by-step tutorial, Meghan Hall shows how to make your ggplot2 visualizations more flexible and robust to accommodate data that's changing. OutlierForgotten BooksScholars that study medieval literature are faced with lots of missing evidence. Manuscripts degrade over time; libraries burn. And what's left is barely a sketch of what the scholars are interested in. But by borrowing statistical concepts from ecology, researchers are able to estimate the data that isn't there. This is an awesome project that explores the issues & approaches, with a new statistics package to boot. To find specific content from prior issues or to research topics, check out the searchable Archives on Data Elixir's Search Page >> |
Older messages
Data Elixir - Issue 374
Tuesday, February 15, 2022
Intro to design-based causal inference. Easy EDA for Pandas. Modeling with encrypted data. Data distribution shifts.
Data Elixir - Issue 373
Tuesday, February 8, 2022
How data businesses work. Salaries dropping. ML monitoring research challenges. Python setup for DS. State of Data Viz.
Data Elixir - Issue 372
Tuesday, February 1, 2022
Predicting experiments. 🟩🟩🟩🟩🟩. Intro to probabilistic programming. Future of the data warehouse. Bad stat critiques.
Data Elixir - Issue 371
Tuesday, January 25, 2022
Research highlights of 2021. Faster Python. Too much data? SQL alternatives. Mistakes included. AI warfare.
Data Elixir - Issue 370
Tuesday, January 18, 2022
ML: 2021 and beyond. State of ML in Julia. Bayesian Modeling w/ Python. Shiny databases. Lead scoring w/ logistic regression. Beautiful plotting in R.
You Might Also Like
New Alpine.js Sort plugin, Laravel 11.5, and more - №510
Sunday, April 28, 2024
Your Laravel week in review ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
This Week's Daily Tip Roundup
Sunday, April 28, 2024
Missed some of this week's tips? No problem. We've compiled all of them here in one convenient place for you to enjoy. Happy learning! iPhoneLife Logo View In Browser Your Tip of the Day is
DeveloPassion's Newsletter #164 - A Thousand Fans
Sunday, April 28, 2024
Edition 164 of my newsletter, discussing Knowledge Management, Knowledge Work, Zen Productivity, Personal Organization, and more! Sébastien Dubois DeveloPassion's Newsletter DeveloPassion's
Nobody Likes a Know-It-All: Smaller LLMs are Gaining Momentum
Sunday, April 28, 2024
Phi-3 and OpenELM, two major small model releases this week. ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
Retro Recomendo: Music
Sunday, April 28, 2024
Recomendo - issue #408 ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
Your Phone’s Other Number 📱
Saturday, April 27, 2024
Let's talk about your phone's IMEI number. Here's a version for your browser. Hunting for the end of the long tail • April 27, 2024 Today in Tedium: As you may know, Tedium is a blog and/or
🕹️ How to Play Retro Games for Free on iPhone — Why I Can't Live Without an eReader
Saturday, April 27, 2024
Also: Anker MagGo (Qi2) Power Bank Review, and More! How-To Geek Logo April 27, 2024 📩 Get expert reviews, the hottest deals, how-to's, breaking news, and more delivered directly to your inbox by
Weekend Reading — The Bob Ross of programming
Saturday, April 27, 2024
This week we use coffee tasting as our design practice, get as close to and as far away from the metal as possible, find an easier way to write documentation, discover why Google Search is getting so
Issue #538: All the Jam entries, Panthera 2, and Tristram
Saturday, April 27, 2024
Weekly newsletter about HTML5 Game Development. Is this email not displaying correctly? View it in your browser. Issue #538 - April 26th 2024 If you have anything you want to share with the HTML5 game
Daily Coding Problem: Problem #1424 [Easy]
Saturday, April 27, 2024
Daily Coding Problem Good morning! Here's your coding interview problem for today. This problem was asked by Microsoft. Implement a URL shortener with the following methods: shorten(url) , which