Data Elixir - Data Elixir - Issue 374
ISSUE 374 · February 15, 2022In the NewsML Becomes a Mathematical CollaboratorMathematicians often work together when they’re searching for insight into a hard problem. It’s a kind of a freewheeling collaborative process that seems to require a uniquely human touch. But in two new results, the role of human collaborator has been replaced in part by a machine... Sponsored LinkDelivering Accurate Ground Truth Data for AI/ML Models30+ years experience working with leading data-centric AI/ML models. With 3,500+ global SMEs and experience with any data type, we accelerate operations and advance models in record time. Scale your AI with your model's new secret weapon, Innodata. Tutorials, Projects & OpinionsData Distribution Shifts and MonitoringDeploying a model to production isn't the end of the process because the model's performance will degrade over time. In this easy-to-follow deep dive, Chip Huyen explores the issues, including data distribution shifts, monitoring, and typical causes of ML failures. This post is intended for a machine learning systems design course at Stanford. Privacy-preserving insurance quotesConcrete Numpy is an open-source python package that compiles various numpy functions into their Fully Homomorphic Encryption (FHE) equivalents. In other words, Concrete Numpy allows models to work with sensitive data while encrypted. This tutorial walks through basic concepts of how Concrete Numpy works and how to use it. Faster Python calculations with NumbaIf you're writing array-oriented Python code that uses For loops, it doesn't help that NumPy is fast because the For loops are in Python, so it's slow. Here's how numba can get you a 13x speed increase with just two lines of code. Read: The Best Tools for ETL in 2022Learn to efficiently integrate your data sources and get to analysis Code & ToolsD-TaleD-Tale is a visualization tool that makes it easy to view and analyze Pandas data structures. D-Tale supports a variety of pandas objects and it works seamlessly with Jupyter notebooks and python terminals. There's a lot of info here, including links to demos, tutorials and articles. Ask HN: Tools to visualize data in SQL databases?Nice discussion about the various tools that are available to visualize data in a SQL table. There are a variety of use-cases here and discussion of pros/cons for both off-the-shelf products and open-source options. ResourcesThe Effect: An Intro to Research Design and CausalityThis new book is a great introduction to design-based causal inference. The first half takes an intuitive approach to develop an understanding for research design. The second half is more technical and introduces a standard toolset for doing causal inference. The entire book is written in a conversational style that's easy to follow. Free to read online. Data VisualizationThis map went viral! Here's how to make it.There's a lot of data presented in this state by state, stream graph representation of population data in the U.S. It's part of a bigger project that was done for a law firm and it's super effective. Here's a step-by-step tutorial showing how to make maps like this using R. To find specific content from prior issues or to research topics, check out the searchable Archives on Data Elixir's Search Page >> |
Older messages
Data Elixir - Issue 373
Tuesday, February 8, 2022
How data businesses work. Salaries dropping. ML monitoring research challenges. Python setup for DS. State of Data Viz.
Data Elixir - Issue 372
Tuesday, February 1, 2022
Predicting experiments. 🟩🟩🟩🟩🟩. Intro to probabilistic programming. Future of the data warehouse. Bad stat critiques.
Data Elixir - Issue 371
Tuesday, January 25, 2022
Research highlights of 2021. Faster Python. Too much data? SQL alternatives. Mistakes included. AI warfare.
Data Elixir - Issue 370
Tuesday, January 18, 2022
ML: 2021 and beyond. State of ML in Julia. Bayesian Modeling w/ Python. Shiny databases. Lead scoring w/ logistic regression. Beautiful plotting in R.
Data Elixir - Issue 369
Tuesday, January 11, 2022
Tactical career planning. The data-to-engineer ratio. ML generalization. Spiral graphs. DS management: the first year. Interview prep guide.
You Might Also Like
How to avoid spam texts
Tuesday, January 14, 2025
Let me ask you something: How many times have you shared your phone number online this month? Every time you do—whether for a delivery, online shopping, or signing up for a new service—you're
BetterDev #273 - Operating System in 1,000 Lines
Monday, January 13, 2025
Better Dev #273 Jan 12, 2025 Hi all, Happy new year. Welcome to the first issue of 2025. I'm trying to become more regular this year. Looking forward to a new year and hope everyone continue to
Daily Coding Problem: Problem #1667 [Hard]
Monday, January 13, 2025
Daily Coding Problem Good morning! Here's your coding interview problem for today. This problem was asked by Airbnb. We're given a hashmap associating each courseId key with a list of courseIds
🧠 Are Supercomputers Dead? — This 90s Tech Is Perfect for Smart TVs
Monday, January 13, 2025
Also: How to Make Sense of Linux Ping Stats, and More! How-To Geek Logo January 13, 2025 Did You Know The original name of the iconic SR-71 Blackbird was actually the RS-71 Blackbird, but Lyndon
Consistency means nothing & Bluesky is reportedly valued at $700
Monday, January 13, 2025
Sill Beta Update #3, Miro AI starts storing AI interactions from free users, Mastodon transfers to a new non-profit organization, and a lot more in this week's issue of Creativerly. Creativerly
Ranked | The AI Models With the Lowest Hallucination Rates 🤖
Monday, January 13, 2025
Hallucination rate is the frequency that an LLM generates false or unsupported information in its outputs. Which models have the lowest rates? View Online | Subscribe | Download Our App FEATURED STORY
GCP Newsletter #433
Monday, January 13, 2025
Welcome to issue #433 January 13th, 2025 News Official Blog Vertex AI Introducing Vertex AI RAG Engine: Scale your Vertex AI RAG pipeline with confidence - Vertex AI RAG Engine is a fully managed
Spyglass Dispatch: It's Political & Personal
Monday, January 13, 2025
On Meta's Moderation Changes • Inside DOGE • Zuck Slams Apple (Again) • Apple's Muted 2025 • CES 2025 Recap The Spyglass Dispatch is a newsletter sent on weekdays featuring links and commentary
$200 to invest today... (USA Only)
Monday, January 13, 2025
Join me in investing in blue chip art on Masterworks, and you will receive $200 to invest on the platform. Not kidding. Founder interview coming soon! ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
The Sequence Knowledge #468: A New Series About RAG
Monday, January 13, 2025
Exploring key concepts of one of the most popular methods in generative AI solutions. ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏