Data Elixir - Data Elixir - Issue 413
ISSUE 413 · November 15, 2022Note that Data Elixir is taking next week off and will be back in your Inbox in two weeks. If you're in the U.S., have a great Thanksgiving! InsightsData’s day of reckoningData is sometimes said to be the "new oil" but for most businesses, that analogy doesn't work. In the most successful of data organizations, data might burn bright but, as Benn Stancil puts it here, in most businesses, data burns more like peat moss. Here are some reasons why, with some ideas for all the median businesses. Sponsored LinkWeb scraping datasets made easy - ScrapFly.ioThe web is full of quality data though scraping it can be difficult. ScrapFly API can retrieve any web page or simplify the web scraping process through cloud web browsers - click buttons, input forms and retrieve the data. ScrapFly comes with a Python SDK making scraping in notebooks a breeze - Try ScrapFly for free! Tutorials, Projects & OpinionsHow Federated Learning Protects PrivacyMost machine learning models are trained by collecting vast amounts of data on a central server. This is a great visual explainer that shows how federated learning makes it possible to train models without any user's raw data leaving their device. Forecasting with Structural AR TimeseriesThe strength of a Bayesian model is largely the flexibility it offers for different modeling tasks. In this tutorial, Nathaniel Forde shows how to fit and predict a range of auto-regressive structural timeseries models and how to predict future observations of the models. Method Chaining in Pandas: Bad Form Or a Recipe For Success?Matt Harrison has written books on pandas and Python and regularly trains data science teams at top companies. And yet, his code is sometimes met with derision online. In this interview, he explores his approach to code, how to think about method chaining, and what separates naive code from good code. Using Functional Analysis to Model Air Pollution DataFunctional analysis is one approach to understand how your data changes within a given timeframe, such as a day, or between timeframes such as many days. This is an easy-to-follow tutorial that shows how to apply functional analysis to some messy air pollution data using R. How I learn machine learningIn a rapidly evolving field like machine learning, you need to figure out what works for you to navigate the never-ending task of staying up to date. In her latest post, Vicki Boykis shares her own process, including lots of links and resources along the way. Tools & CodeDebirdifyThis is a great tool if you're looking for Mastodon accounts to follow and want something more nuanced than a haphazard list of user handles. Debirdify searches a specific Twitter user's Lists and/or Followed Accounts for associated Mastodon handles and returns a Mastodon-friendly csv file. ResourcesAdvanced NLP - Carnegie Mellon 2022Graham Neubig's "Advanced NLP" is one of the best resources you'll find for current state-of-the-art techniques and algorithms in modern NLP. Follow the links for the slides and an awesome collection of readings and resources. Go here for the lecture videos 👉 CareerLooking for Ambitious Machine Learning EngineersRatio is a revenue-generating startup that's looking for ambitious machine learning engineers to help automate a big part of the advertising space. The product is "like a self-driving car of marketing" and largely uses existing models from OpenAI. Remote OK. New OpportunitiesIn addition to office-based positions around the world, Data Elixir's Job Board currently has 35+ listings for remote positions, including roles for data scientists, data analysts, researchers, data architects, machine learning engineers, and more. The roles cover a variety of job levels, from Junior to Senior. If you're HIRING, join the Data Elixir Talent Collective and get regular drops of outstanding data practitioners and leaders who are open to new opportunities 👉Data VisualizationImages by Daniel Coe / CC BY-NC-ND 2.0 / Links: Image 1 Image 2 Image 3 Visualizing Rivers and Floodplains with USGS DataAwesome tutorial that shows how to create visualizations of the flow of water through rivers and floodplains using publicly available USGS data and open source tools. Includes links to tools, data, key resources, and a gallery of stunning visualiztions. Galileo’s Telescopic Discoveries: |
Older messages
Data Elixir - Issue 412
Tuesday, November 8, 2022
Python+SQL: SpyQL. Forecasting principles and practice. A/B testing caveats and limitations. Bullet graphs. Simplifying MLOps.
Data Elixir - Issue 411
Tuesday, November 1, 2022
Bayesian structural timeseries. Building data dictionaries. Visualization w/ Python. Dashboard design patterns.
Data Elixir - Issue 410
Tuesday, October 25, 2022
Data Stack in a Box. Earth System Modeling. Guide to posterior predictions. DS interview book.
Data Elixir - Issue 409
Tuesday, October 18, 2022
Monetizing internal tools. Experimentation platform in a day. Quarto Q/A. Quote extraction w/ NLP. Visualizing spatial data w/ Python.
Data Elixir - Issue 408
Tuesday, October 11, 2022
State of AI 2022. Exploratory causal analysis. Data testing for Python. Chance encounters. RecSys 2022. Building platforms for DS.
You Might Also Like
💻 Installing Linux on an Old Laptop Instead of a Raspberry Pi — Flagship Phones Need More Storage
Monday, November 18, 2024
Also: I Built the Perfect Programming Platform In Less Than 10 Minutes, and More! How-To Geek Logo November 18, 2024 Did You Know The Sixth Sense was the highest-grossing horror film of all time in
Daily Coding Problem: Problem #1612 [Hard]
Monday, November 18, 2024
Daily Coding Problem Good morning! Here's your coding interview problem for today. This problem was asked by Etsy. Given a sorted array, convert it into a height-balanced binary search tree.
10,000 ways to fail & The European Search Perspective
Monday, November 18, 2024
Reflecting on over five years of Creativerly, Signal introduces Call Links, the science of mental models, and a lot more in this week's issue of Creativerly. Creativerly 10000 ways to fail &
Charted | Global GHG Emissions, by Sector 🌎
Monday, November 18, 2024
In this graphic, we show greenhouse gas emissions by sector in 2023. View Online | Subscribe | Download Our App Presented by: New 3-Part Series: Bitcoin Demystified >> Learn more about one of the
Spyglass Dispatch: Samsung/Google Smart Glasses • Star Wars Mess • Netflix Knocked Out • Conan's Oscars • MicroStrategy's Comeback • Vision Pro In Focus • Saving 'Inside the NBA' • Apple Television Lives!
Monday, November 18, 2024
Samsung/Google Smart Glasses • Star Wars Mess • Netflix Knocked Out • Conan's Oscars • MicroStrategy's Comeback • Vision Pro In Focus • Saving 'Inside the NBA' • Apple Television Lives!
GCP Newsletter #424
Monday, November 18, 2024
Welcome to issue #425 November 18th, 2024 News Google Kubernetes Engine Official Blog 65000 nodes and counting: Google Kubernetes Engine is ready for trillion-parameter AI models - Google Kubernetes
Design and code beautiful products. Together.
Monday, November 18, 2024
Pablo Ruiz-Múzquiz and the team at Penpot have recently announced a new plugin feature that allows users to build new tools and functionalities on the platform. ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
Can Bitcoin Put an End to Forever War?
Monday, November 18, 2024
Top Tech Content sent at Noon! How the world collects web data Read this email in your browser How are you, @newsletterest1? 🪐 What's happening in tech today, November 18, 2024? The HackerNoon
25 tips for programming with AI
Monday, November 18, 2024
Meta Quest dominates Steam VR; Stop squirting hot glue into devices -- ZDNET ZDNET Tech Today - US November 18, 2024 digitalspeed-gettyimages-1322205545 25 AI tips to boost your programming
Ordering, Grouping and Consistency in Messaging systems
Monday, November 18, 2024
We went quite far from our Queue Broker series in recent editions, but today, we're back to it! By powers combined, I joined our Queue Broker implementation to solve the generic idempotency check