Data Elixir - Data Elixir - Issue 404
ISSUE 404 · September 13, 2022InsightOrganizations need to deliberately create dataPeople sometimes say that "data is the new oil," but that line of thinking confines models to the data that's available for extraction. A better approach is to figure out what data you need and then figure out how to create it. This is a great post that explores the limitations of extracted data and how teams gain by deliberately creating the data they need. Takeaways from Gartner Data & Analytics SummitNice overview of highlights and four big ideas from the recent Gartner Data & Analytics Summit. Sponsored LinkRegister: TechCrunch x iMerit ML DataOps SummitJoin 2,000+ data scientists, engineers, and ML Professionals virtually at the iMerit ML DataOps Summit to hear from leaders at the forefront of deploying ML DataOps solutions that power machine learning and artificial intelligence. Register for free. Tutorials, Projects & Opinions5 questions to categorize machine learning interpretability approachesAfter reading hundreds of papers and writing a book on machine learning interpretation, Christoph Molnar has identified some useful categories of interpretation techniques. In this post, he organizes his thinking into five simple questions that will help you assess the ML interpretation approaches that are suitable for different use-cases. Getting Started with Apache Arrow in RNice collection of R resources, cheatsheets, and a tutorial for using Apache Arrow to work with data that's larger than memory. This is aimed at experienced R users who are new to Arrow. Want a data science project?This is the best take I've seen on the recently released treasure trove of hospital pricing data. Over 100TB of data was released and by all accounts, it's a mess. In this post, Randy Au explores what's available, what needs to happen next and how, ultimately, the lack of tools and structure has more to do with real-world data handling issues than maliciousness. There are important problems and opportunities here. Djinn by Tonic.ai - AI-driven synthetic data modelsWhether it's privacy controls or a lack of high quality data slowing you down, Djinn's AI-driven synthetic data models create private and augmented data within minutes of setup. Answer nuanced scientific questions, optimize business processes, and make better decisions. Code & ToolsPySearch: Python Function Search by DescriptionPySearch is a free search engine for querying python libraries using natural language descriptions. Just select the libraries you want to search and then use natural language or keywords to describe what you're looking for. Check out the examples to see it in action. Data VisualizationMapping wind data with RGreat R tutorial that shows how to access, reshape and visualize wind data as streamlines. This is a step-by-step tutorial that includes code and links to key resources along the way. Which fonts to use for your charts and tablesSans-serif or serif typefaces? Lining or oldstyle figures? Narrow or wide? With lots of examples, this post explains which fonts work best for various types of data visualizations. CareerThe Difficult Life of the Data LeadAs data teams get bigger, more Data Leads are needed but Data Leads have one of the hardest roles in data. They have to manage a team, work with stakeholders and still stay hands-on. In this post, Mikkel Dengsøe explores the challenges and ideas for making the role better. Join the Data Elixir Talent CollectiveThe Data Elixir Talent Collective is a reverse job board where top companies apply to you. Choose to be anonymous or public and get matched with opportunities that fit your specific interests. This is a free resource but membership is limited. To apply, you need 3+ years experience in data science, analytics, machine learning, visualization, or a related field. For more info, APPLY HERE. If you’re hiring, apply now to find top candidates faster, sourced from the Data Elixir community. We're creating the highest signal-to-noise hiring resource for roles in the data ecosystem. Already, there are more than 100 mid to senior level candidates from a wide variety of organizations; from fast moving startups to big companies, like Google, Amazon, Apple, NVIDIA and more. If you're hiring, APPLY HERE. Was this email forwarded to you? Sign up here >> |
Older messages
Data Elixir - Issue 403
Tuesday, September 6, 2022
Semantic layers: a deep dive. Communicating A/B test results. Using an attribution framework. Bayesian age-period-cohort models in Python.
Data Elixir - Issue 402
Tuesday, August 30, 2022
Modeling and analytics for ⚽. Practical causal forecasting. Intro to data contracts. Expressive analytics w/ Python.
Data Elixir - Issue 401
Tuesday, August 23, 2022
Homegrown auth w/ ML. Intro to backprop. Key-value DBs. GPT-3 for science. Data product canvas. R-spatial ecosystem.
Data Elixir - Issue 400
Tuesday, August 16, 2022
Deep dive into SVD. Smart paywalls. Idea to funding. Bayesian inference at scale. Logistic regression explainer.
Data Elixir - Issue 399
Tuesday, August 9, 2022
The 8 slide resume. Intro to streaming for data scientists. Random Forest explainer.
You Might Also Like
Import AI 399: 1,000 samples to make a reasoning model; DeepSeek proliferation; Apple's self-driving car simulator
Friday, February 14, 2025
What came before the golem? ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
Defining Your Paranoia Level: Navigating Change Without the Overkill
Friday, February 14, 2025
We've all been there: trying to learn something new, only to find our old habits holding us back. We discussed today how our gut feelings about solving problems can sometimes be our own worst enemy
5 ways AI can help with taxes 🪄
Friday, February 14, 2025
Remotely control an iPhone; 💸 50+ early Presidents' Day deals -- ZDNET ZDNET Tech Today - US February 10, 2025 5 ways AI can help you with your taxes (and what not to use it for) 5 ways AI can help
Recurring Automations + Secret Updates
Friday, February 14, 2025
Smarter automations, better templates, and hidden updates to explore 👀 ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
The First Provable AI-Proof Game: Introducing Butterfly Wings 4
Friday, February 14, 2025
Top Tech Content sent at Noon! Boost Your Article on HackerNoon for $159.99! Read this email in your browser How are you, @newsletterest1? undefined The Market Today #01 Instagram (Meta) 714.52 -0.32%
GCP Newsletter #437
Friday, February 14, 2025
Welcome to issue #437 February 10th, 2025 News BigQuery Cloud Marketplace Official Blog Partners BigQuery datasets now available on Google Cloud Marketplace - Google Cloud Marketplace now offers
Charted | The 1%'s Share of U.S. Wealth Over Time (1989-2024) 💰
Friday, February 14, 2025
Discover how the share of US wealth held by the top 1% has evolved from 1989 to 2024 in this infographic. View Online | Subscribe | Download Our App Download our app to see thousands of new charts from
The Great Social Media Diaspora & Tapestry is here
Friday, February 14, 2025
Apple introduces new app called 'Apple Invites', The Iconfactory launches Tapestry, beyond the traditional portfolio, and more in this week's issue of Creativerly. Creativerly The Great
Daily Coding Problem: Problem #1689 [Medium]
Friday, February 14, 2025
Daily Coding Problem Good morning! Here's your coding interview problem for today. This problem was asked by Google. Given a linked list, sort it in O(n log n) time and constant space. For example,
📧 Stop Conflating CQRS and MediatR
Friday, February 14, 2025
Stop Conflating CQRS and MediatR Read on: my website / Read time: 4 minutes The .NET Weekly is brought to you by: Step right up to the Generative AI Use Cases Repository! See how MongoDB powers your