Data Elixir - Data Elixir - Issue 404
ISSUE 404 · September 13, 2022InsightOrganizations need to deliberately create dataPeople sometimes say that "data is the new oil," but that line of thinking confines models to the data that's available for extraction. A better approach is to figure out what data you need and then figure out how to create it. This is a great post that explores the limitations of extracted data and how teams gain by deliberately creating the data they need. Takeaways from Gartner Data & Analytics SummitNice overview of highlights and four big ideas from the recent Gartner Data & Analytics Summit. Sponsored LinkRegister: TechCrunch x iMerit ML DataOps SummitJoin 2,000+ data scientists, engineers, and ML Professionals virtually at the iMerit ML DataOps Summit to hear from leaders at the forefront of deploying ML DataOps solutions that power machine learning and artificial intelligence. Register for free. Tutorials, Projects & Opinions5 questions to categorize machine learning interpretability approachesAfter reading hundreds of papers and writing a book on machine learning interpretation, Christoph Molnar has identified some useful categories of interpretation techniques. In this post, he organizes his thinking into five simple questions that will help you assess the ML interpretation approaches that are suitable for different use-cases. Getting Started with Apache Arrow in RNice collection of R resources, cheatsheets, and a tutorial for using Apache Arrow to work with data that's larger than memory. This is aimed at experienced R users who are new to Arrow. Want a data science project?This is the best take I've seen on the recently released treasure trove of hospital pricing data. Over 100TB of data was released and by all accounts, it's a mess. In this post, Randy Au explores what's available, what needs to happen next and how, ultimately, the lack of tools and structure has more to do with real-world data handling issues than maliciousness. There are important problems and opportunities here. Djinn by Tonic.ai - AI-driven synthetic data modelsWhether it's privacy controls or a lack of high quality data slowing you down, Djinn's AI-driven synthetic data models create private and augmented data within minutes of setup. Answer nuanced scientific questions, optimize business processes, and make better decisions. Code & ToolsPySearch: Python Function Search by DescriptionPySearch is a free search engine for querying python libraries using natural language descriptions. Just select the libraries you want to search and then use natural language or keywords to describe what you're looking for. Check out the examples to see it in action. Data VisualizationMapping wind data with RGreat R tutorial that shows how to access, reshape and visualize wind data as streamlines. This is a step-by-step tutorial that includes code and links to key resources along the way. Which fonts to use for your charts and tablesSans-serif or serif typefaces? Lining or oldstyle figures? Narrow or wide? With lots of examples, this post explains which fonts work best for various types of data visualizations. CareerThe Difficult Life of the Data LeadAs data teams get bigger, more Data Leads are needed but Data Leads have one of the hardest roles in data. They have to manage a team, work with stakeholders and still stay hands-on. In this post, Mikkel Dengsøe explores the challenges and ideas for making the role better. Join the Data Elixir Talent CollectiveThe Data Elixir Talent Collective is a reverse job board where top companies apply to you. Choose to be anonymous or public and get matched with opportunities that fit your specific interests. This is a free resource but membership is limited. To apply, you need 3+ years experience in data science, analytics, machine learning, visualization, or a related field. For more info, APPLY HERE. If you’re hiring, apply now to find top candidates faster, sourced from the Data Elixir community. We're creating the highest signal-to-noise hiring resource for roles in the data ecosystem. Already, there are more than 100 mid to senior level candidates from a wide variety of organizations; from fast moving startups to big companies, like Google, Amazon, Apple, NVIDIA and more. If you're hiring, APPLY HERE. Was this email forwarded to you? Sign up here >> |
Older messages
Data Elixir - Issue 403
Tuesday, September 6, 2022
Semantic layers: a deep dive. Communicating A/B test results. Using an attribution framework. Bayesian age-period-cohort models in Python.
Data Elixir - Issue 402
Tuesday, August 30, 2022
Modeling and analytics for ⚽. Practical causal forecasting. Intro to data contracts. Expressive analytics w/ Python.
Data Elixir - Issue 401
Tuesday, August 23, 2022
Homegrown auth w/ ML. Intro to backprop. Key-value DBs. GPT-3 for science. Data product canvas. R-spatial ecosystem.
Data Elixir - Issue 400
Tuesday, August 16, 2022
Deep dive into SVD. Smart paywalls. Idea to funding. Bayesian inference at scale. Logistic regression explainer.
Data Elixir - Issue 399
Tuesday, August 9, 2022
The 8 slide resume. Intro to streaming for data scientists. Random Forest explainer.
You Might Also Like
💻 Installing Linux on an Old Laptop Instead of a Raspberry Pi — Flagship Phones Need More Storage
Monday, November 18, 2024
Also: I Built the Perfect Programming Platform In Less Than 10 Minutes, and More! How-To Geek Logo November 18, 2024 Did You Know The Sixth Sense was the highest-grossing horror film of all time in
Daily Coding Problem: Problem #1612 [Hard]
Monday, November 18, 2024
Daily Coding Problem Good morning! Here's your coding interview problem for today. This problem was asked by Etsy. Given a sorted array, convert it into a height-balanced binary search tree.
10,000 ways to fail & The European Search Perspective
Monday, November 18, 2024
Reflecting on over five years of Creativerly, Signal introduces Call Links, the science of mental models, and a lot more in this week's issue of Creativerly. Creativerly 10000 ways to fail &
Charted | Global GHG Emissions, by Sector 🌎
Monday, November 18, 2024
In this graphic, we show greenhouse gas emissions by sector in 2023. View Online | Subscribe | Download Our App Presented by: New 3-Part Series: Bitcoin Demystified >> Learn more about one of the
Spyglass Dispatch: Samsung/Google Smart Glasses • Star Wars Mess • Netflix Knocked Out • Conan's Oscars • MicroStrategy's Comeback • Vision Pro In Focus • Saving 'Inside the NBA' • Apple Television Lives!
Monday, November 18, 2024
Samsung/Google Smart Glasses • Star Wars Mess • Netflix Knocked Out • Conan's Oscars • MicroStrategy's Comeback • Vision Pro In Focus • Saving 'Inside the NBA' • Apple Television Lives!
GCP Newsletter #424
Monday, November 18, 2024
Welcome to issue #425 November 18th, 2024 News Google Kubernetes Engine Official Blog 65000 nodes and counting: Google Kubernetes Engine is ready for trillion-parameter AI models - Google Kubernetes
Design and code beautiful products. Together.
Monday, November 18, 2024
Pablo Ruiz-Múzquiz and the team at Penpot have recently announced a new plugin feature that allows users to build new tools and functionalities on the platform. ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
Can Bitcoin Put an End to Forever War?
Monday, November 18, 2024
Top Tech Content sent at Noon! How the world collects web data Read this email in your browser How are you, @newsletterest1? 🪐 What's happening in tech today, November 18, 2024? The HackerNoon
25 tips for programming with AI
Monday, November 18, 2024
Meta Quest dominates Steam VR; Stop squirting hot glue into devices -- ZDNET ZDNET Tech Today - US November 18, 2024 digitalspeed-gettyimages-1322205545 25 AI tips to boost your programming
Ordering, Grouping and Consistency in Messaging systems
Monday, November 18, 2024
We went quite far from our Queue Broker series in recent editions, but today, we're back to it! By powers combined, I joined our Queue Broker implementation to solve the generic idempotency check