Data Elixir - Data Elixir - Issue 413
ISSUE 413 · November 15, 2022Note that Data Elixir is taking next week off and will be back in your Inbox in two weeks. If you're in the U.S., have a great Thanksgiving! InsightsData’s day of reckoningData is sometimes said to be the "new oil" but for most businesses, that analogy doesn't work. In the most successful of data organizations, data might burn bright but, as Benn Stancil puts it here, in most businesses, data burns more like peat moss. Here are some reasons why, with some ideas for all the median businesses. Sponsored LinkWeb scraping datasets made easy - ScrapFly.ioThe web is full of quality data though scraping it can be difficult. ScrapFly API can retrieve any web page or simplify the web scraping process through cloud web browsers - click buttons, input forms and retrieve the data. ScrapFly comes with a Python SDK making scraping in notebooks a breeze - Try ScrapFly for free! Tutorials, Projects & OpinionsHow Federated Learning Protects PrivacyMost machine learning models are trained by collecting vast amounts of data on a central server. This is a great visual explainer that shows how federated learning makes it possible to train models without any user's raw data leaving their device. Forecasting with Structural AR TimeseriesThe strength of a Bayesian model is largely the flexibility it offers for different modeling tasks. In this tutorial, Nathaniel Forde shows how to fit and predict a range of auto-regressive structural timeseries models and how to predict future observations of the models. Method Chaining in Pandas: Bad Form Or a Recipe For Success?Matt Harrison has written books on pandas and Python and regularly trains data science teams at top companies. And yet, his code is sometimes met with derision online. In this interview, he explores his approach to code, how to think about method chaining, and what separates naive code from good code. Using Functional Analysis to Model Air Pollution DataFunctional analysis is one approach to understand how your data changes within a given timeframe, such as a day, or between timeframes such as many days. This is an easy-to-follow tutorial that shows how to apply functional analysis to some messy air pollution data using R. How I learn machine learningIn a rapidly evolving field like machine learning, you need to figure out what works for you to navigate the never-ending task of staying up to date. In her latest post, Vicki Boykis shares her own process, including lots of links and resources along the way. Tools & CodeDebirdifyThis is a great tool if you're looking for Mastodon accounts to follow and want something more nuanced than a haphazard list of user handles. Debirdify searches a specific Twitter user's Lists and/or Followed Accounts for associated Mastodon handles and returns a Mastodon-friendly csv file. ResourcesAdvanced NLP - Carnegie Mellon 2022Graham Neubig's "Advanced NLP" is one of the best resources you'll find for current state-of-the-art techniques and algorithms in modern NLP. Follow the links for the slides and an awesome collection of readings and resources. Go here for the lecture videos 👉 CareerLooking for Ambitious Machine Learning EngineersRatio is a revenue-generating startup that's looking for ambitious machine learning engineers to help automate a big part of the advertising space. The product is "like a self-driving car of marketing" and largely uses existing models from OpenAI. Remote OK. New OpportunitiesIn addition to office-based positions around the world, Data Elixir's Job Board currently has 35+ listings for remote positions, including roles for data scientists, data analysts, researchers, data architects, machine learning engineers, and more. The roles cover a variety of job levels, from Junior to Senior. If you're HIRING, join the Data Elixir Talent Collective and get regular drops of outstanding data practitioners and leaders who are open to new opportunities 👉Data VisualizationImages by Daniel Coe / CC BY-NC-ND 2.0 / Links: Image 1 Image 2 Image 3 Visualizing Rivers and Floodplains with USGS DataAwesome tutorial that shows how to create visualizations of the flow of water through rivers and floodplains using publicly available USGS data and open source tools. Includes links to tools, data, key resources, and a gallery of stunning visualiztions. Galileo’s Telescopic Discoveries: |
Key phrases
Older messages
Data Elixir - Issue 412
Tuesday, November 8, 2022
Python+SQL: SpyQL. Forecasting principles and practice. A/B testing caveats and limitations. Bullet graphs. Simplifying MLOps.
Data Elixir - Issue 411
Tuesday, November 1, 2022
Bayesian structural timeseries. Building data dictionaries. Visualization w/ Python. Dashboard design patterns.
Data Elixir - Issue 410
Tuesday, October 25, 2022
Data Stack in a Box. Earth System Modeling. Guide to posterior predictions. DS interview book.
Data Elixir - Issue 409
Tuesday, October 18, 2022
Monetizing internal tools. Experimentation platform in a day. Quarto Q/A. Quote extraction w/ NLP. Visualizing spatial data w/ Python.
Data Elixir - Issue 408
Tuesday, October 11, 2022
State of AI 2022. Exploratory causal analysis. Data testing for Python. Chance encounters. RecSys 2022. Building platforms for DS.
You Might Also Like
DeveloPassion's Newsletter #164 - A Thousand Fans
Sunday, April 28, 2024
Edition 164 of my newsletter, discussing Knowledge Management, Knowledge Work, Zen Productivity, Personal Organization, and more! Sébastien Dubois DeveloPassion's Newsletter DeveloPassion's
Nobody Likes a Know-It-All: Smaller LLMs are Gaining Momentum
Sunday, April 28, 2024
Phi-3 and OpenELM, two major small model releases this week. ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
Retro Recomendo: Music
Sunday, April 28, 2024
Recomendo - issue #408 ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
Your Phone’s Other Number 📱
Saturday, April 27, 2024
Let's talk about your phone's IMEI number. Here's a version for your browser. Hunting for the end of the long tail • April 27, 2024 Today in Tedium: As you may know, Tedium is a blog and/or
🕹️ How to Play Retro Games for Free on iPhone — Why I Can't Live Without an eReader
Saturday, April 27, 2024
Also: Anker MagGo (Qi2) Power Bank Review, and More! How-To Geek Logo April 27, 2024 📩 Get expert reviews, the hottest deals, how-to's, breaking news, and more delivered directly to your inbox by
Weekend Reading — The Bob Ross of programming
Saturday, April 27, 2024
This week we use coffee tasting as our design practice, get as close to and as far away from the metal as possible, find an easier way to write documentation, discover why Google Search is getting so
Issue #538: All the Jam entries, Panthera 2, and Tristram
Saturday, April 27, 2024
Weekly newsletter about HTML5 Game Development. Is this email not displaying correctly? View it in your browser. Issue #538 - April 26th 2024 If you have anything you want to share with the HTML5 game
Daily Coding Problem: Problem #1424 [Easy]
Saturday, April 27, 2024
Daily Coding Problem Good morning! Here's your coding interview problem for today. This problem was asked by Microsoft. Implement a URL shortener with the following methods: shorten(url) , which
Charted | Countries That Became More Happy (or Unhappy) Since 2010 😅
Saturday, April 27, 2024
Which countries had the highest happiness gains since 2010? Which became sadder? View Online | Subscribe Presented by Voronoi: The App Where Data Tells the Story FEATURED STORY Countries With the
Noonification: What Is E-Waste Hacking?
Saturday, April 27, 2024
Top Tech Content sent at Noon! The first AI-powered startup unlocking the “billionaire economy” for your benefit How are you, @newsletterest1? 🪐 What's happening in tech this week: The