Data Elixir - Data Elixir - Issue 371
ISSUE 371 · January 25, 2022Interested in editing/writing for Data Elixir?I'm looking for a couple guest curators to help pull things together for one or more issues in 2022. There are a lot of ways it could work. If you're interested, answer a few quick questions and I'll be in touch. InsightGood Data Citizenship Doesn’t WorkMore data isn't necessarily better. The key is managing it in a way that's useful and doesn't detract you from reaching your goals. This post uses lots of examples to explore the issues & lessons learned along the way. The Rise of A.I. Fighter PilotsArtificial intelligence is being taught to fly warplanes. That's not really surprising but how can anyone trust it? Sponsored LinkFree Course: Natural Language Processing (NLP) for Semantic SearchLearn how to build semantic search applications! This free course from Pinecone covers everything you need to build state-of-the-art language models for semantic search. Start reading→ Tutorials, Projects & OpinionsML and NLP Research Highlights of 2021There were a lot of advances in machine learning and natural language processing last year. This post doesn't cover everything but it's a great collection of interesting highlights across multiple impactful areas. Each section includes a summary of the highlight, why it's important, links to key papers, diagrams and thoughts about what's next. My Machine Learning Process (Mistakes Included)Writers generally clean out any mistakes in their posts, which can make it seem that everything always works out perfectly. Not this one. This tutorial shows how to train a machine learning model in the real-world, "with all the mistakes and fruitless efforts included." How vectorization speeds up your Python codePython has a lot going for it but it's not the fastest. This is a great post that shows how to process a large amount of homogeneous data quickly in python using vectorization. Learn what that means, when it applies, and how to do it. Binding Apache Arrow to RApache Arrow is an open-source library that simplifies working with flat and hierarchical data. This is a fun and gentle introduction to the inner workings of Apache Arrow and how to use it from within R. How Hasura improved conversion by 20% with PostHogIn 2021 Hasura’s Engineering and UX teams started self-hosting PostHog to collect product insights without needing to share user data with third parties. Using PostHog’s funnel analysis and session recording tools, Hasura’s team was able to improve conversion by 20% overnight. Code & ToolsPRQL - Pipelined Relational Query LanguagePRQL is a SQL alternative that compiles to SQL and works anywhere SQL does. Like SQL, it's readable, explicit and declarative but unlike SQL, it forms a logical pipeline of transformations, and supports abstractions such as variables and functions. For a lively discussion on the project, check out this post on Hacker
News >> tinygp - the tiniest of Gaussian Process librariestinygp is an extremely lightweight library for building Gaussian Process (GP) models in Python, built on top of jax. It has a nice interface, it’s pretty fast and, thanks to jax, tinygp supports things like GPU acceleration and automatic differentiation. ResourcesPython for Data Analysis: |
Older messages
Data Elixir - Issue 370
Tuesday, January 18, 2022
ML: 2021 and beyond. State of ML in Julia. Bayesian Modeling w/ Python. Shiny databases. Lead scoring w/ logistic regression. Beautiful plotting in R.
Data Elixir - Issue 369
Tuesday, January 11, 2022
Tactical career planning. The data-to-engineer ratio. ML generalization. Spiral graphs. DS management: the first year. Interview prep guide.
Data Elixir - Issue 368
Tuesday, January 4, 2022
Top notebooks of 2021. ⚽ Analytics Review. On testing. Real-time ML. ML YouTube.
Data Elixir - Issue 367
Tuesday, December 21, 2021
Top Python libraries in 2021. Essential visualization. Jupyter games. Life of an ML dataset. Data versioning.
Data Elixir - Issue 366
Tuesday, December 14, 2021
Big data paradox. Data "scientists"...? ML playgrounds. Data serialisation in R. Building models like open-source software.
You Might Also Like
Import AI 399: 1,000 samples to make a reasoning model; DeepSeek proliferation; Apple's self-driving car simulator
Friday, February 14, 2025
What came before the golem? ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
Defining Your Paranoia Level: Navigating Change Without the Overkill
Friday, February 14, 2025
We've all been there: trying to learn something new, only to find our old habits holding us back. We discussed today how our gut feelings about solving problems can sometimes be our own worst enemy
5 ways AI can help with taxes 🪄
Friday, February 14, 2025
Remotely control an iPhone; 💸 50+ early Presidents' Day deals -- ZDNET ZDNET Tech Today - US February 10, 2025 5 ways AI can help you with your taxes (and what not to use it for) 5 ways AI can help
Recurring Automations + Secret Updates
Friday, February 14, 2025
Smarter automations, better templates, and hidden updates to explore 👀 ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
The First Provable AI-Proof Game: Introducing Butterfly Wings 4
Friday, February 14, 2025
Top Tech Content sent at Noon! Boost Your Article on HackerNoon for $159.99! Read this email in your browser How are you, @newsletterest1? undefined The Market Today #01 Instagram (Meta) 714.52 -0.32%
GCP Newsletter #437
Friday, February 14, 2025
Welcome to issue #437 February 10th, 2025 News BigQuery Cloud Marketplace Official Blog Partners BigQuery datasets now available on Google Cloud Marketplace - Google Cloud Marketplace now offers
Charted | The 1%'s Share of U.S. Wealth Over Time (1989-2024) 💰
Friday, February 14, 2025
Discover how the share of US wealth held by the top 1% has evolved from 1989 to 2024 in this infographic. View Online | Subscribe | Download Our App Download our app to see thousands of new charts from
The Great Social Media Diaspora & Tapestry is here
Friday, February 14, 2025
Apple introduces new app called 'Apple Invites', The Iconfactory launches Tapestry, beyond the traditional portfolio, and more in this week's issue of Creativerly. Creativerly The Great
Daily Coding Problem: Problem #1689 [Medium]
Friday, February 14, 2025
Daily Coding Problem Good morning! Here's your coding interview problem for today. This problem was asked by Google. Given a linked list, sort it in O(n log n) time and constant space. For example,
📧 Stop Conflating CQRS and MediatR
Friday, February 14, 2025
Stop Conflating CQRS and MediatR Read on: my website / Read time: 4 minutes The .NET Weekly is brought to you by: Step right up to the Generative AI Use Cases Repository! See how MongoDB powers your