Data Elixir - Data Elixir - Issue 393
ISSUE 393 · June 28, 2022TrendsThe evolution of machine learning infrastructureBudgets for data and machine learning are hitting all-time highs and lots of value is being created across the space. In this post from Bessemer Venture Partners, Bhavik Nagda and Sakib Dadi explore what’s been happening in the machine learning technology stack, the key trends driving innovation, and where the opportunities are now. Fair GameWeb scraping is widely used by industry and researchers and has been deemed to be legal by courts in the U.S. Even so, using scraped data in contexts that it wasn't intended is rife with problems. This is a great article about the complicated dance between privacy, research, and the industries that have evolved around scraped data. Sponsored LinkHow to Capture Advantages by Investing in High-Quality Training DataAt the enterprise level, machine learning requires either large amounts of training data or a smaller set of extremely high quality data, as well as the infrastructure to support high data volumes. Consequently, labeling data through robust software or in partnership with an annotation service provider is critical to project success. Read more. Tutorials, Projects & OpinionsThings You Should Know About DatabasesIt's surprisingly easy to work with databases and not really understand basic principles about how they work. In this primer, Mahdi Yusuf walks through two of the most important topics when working with relational database systems: indexes and transactions. One year as a solo dev building open-source data tools without fundingGreat, first-hand account of turning a side project into something big. Or, bigger, anyway. The reality is, going solo is complicated but, in spite of that, there's been a lot of interest in Phil Eaton's DataStation recently. This post covers the journey so far: from problem definition to software
development to VC interest, and 4000 stars on GitHub! Why I still recommend JuliaA few weeks ago, there were lengthy online discussions about bugs in the Julia ecosystem and why it shouldn't be used when correctness matters. In this counter-post, Rik Huijzer walks through the things that Julia gets right and why, even though Julia isn't perfect, he still recommends it. Includes links to key discussions. OpenBB TerminalOpenBB Terminal is an open-source, Python-based platform for investment research. It's inspired by Bloomberg Terminal but without the fees. It gives you access to massive amounts of real-time and historical data and since it's Python, you can use common data science libraries with it like Pandas, Numpy, Scipy, Jupyter, Pytorch, etc. Data Product in Changing Environments: Rethinking and Updating InvestmentsDatasets and tools that are built for specific people in an organization can easily be forgotten when those people leave. If you're the one building those tools, here's a better way to approach new projects to make sure your time is well spent. Top 3 ways IMDb’s third-party data enhances user engagement and experienceJoin this virtual event to learn the top 3 ways to increase customer engagement and improve user experience by leveraging IMDb’s third-party data. Speakers from AWS and IMDb will show you how to gain new consumer insights, deploy customer-centric features, and create personalized user experiences. Register Now CareerWorkforce Wanted - Data Talent For Social ImpactThis new report explores the rapidly growing landscape of data for social good. The report is primarily focused on ways to develop a global talent force of purpose-driven data professionals and along the way, it highlights issues and opportunities. Data VisualizationHard Data: The Erotics Of InfographicsThis is an edgy take on infographics but it sure is compelling: at the end of the day, much of the media's interest in data lies in its monetization. And it turns out, visualization and the data fetishism that the media has fostered with it has created an easy way to convert data to dollars. Was this email forwarded to you? Sign up here >> |
Older messages
Data Elixir - Issue 392
Tuesday, June 21, 2022
Graph ML intro. JTBD for data teams. How to get data out of PDFs. Research highlights from Meta. Reproducible research workflows.
Data Elixir - Issue 391
Tuesday, June 14, 2022
ML design patterns. Awesome data leadership. Faster Pandas. ML experimentation in VS Code.
Data Elixir - Issue 390
Tuesday, June 7, 2022
Friendlier SQL. Decision Intelligence framework. Collective data rights.
Data Elixir - Issue 389
Tuesday, May 31, 2022
The technical pay gap. DS: Foundations, Challenges, Opportunities. Existential threat of data quality. ML visual explainers.
Data Elixir - Issue 388
Tuesday, May 24, 2022
Software development for DS. How random forests really work. Visualizing multicollinearity. ML for conservation.
You Might Also Like
Import AI 399: 1,000 samples to make a reasoning model; DeepSeek proliferation; Apple's self-driving car simulator
Friday, February 14, 2025
What came before the golem? ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
Defining Your Paranoia Level: Navigating Change Without the Overkill
Friday, February 14, 2025
We've all been there: trying to learn something new, only to find our old habits holding us back. We discussed today how our gut feelings about solving problems can sometimes be our own worst enemy
5 ways AI can help with taxes 🪄
Friday, February 14, 2025
Remotely control an iPhone; 💸 50+ early Presidents' Day deals -- ZDNET ZDNET Tech Today - US February 10, 2025 5 ways AI can help you with your taxes (and what not to use it for) 5 ways AI can help
Recurring Automations + Secret Updates
Friday, February 14, 2025
Smarter automations, better templates, and hidden updates to explore 👀 ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
The First Provable AI-Proof Game: Introducing Butterfly Wings 4
Friday, February 14, 2025
Top Tech Content sent at Noon! Boost Your Article on HackerNoon for $159.99! Read this email in your browser How are you, @newsletterest1? undefined The Market Today #01 Instagram (Meta) 714.52 -0.32%
GCP Newsletter #437
Friday, February 14, 2025
Welcome to issue #437 February 10th, 2025 News BigQuery Cloud Marketplace Official Blog Partners BigQuery datasets now available on Google Cloud Marketplace - Google Cloud Marketplace now offers
Charted | The 1%'s Share of U.S. Wealth Over Time (1989-2024) 💰
Friday, February 14, 2025
Discover how the share of US wealth held by the top 1% has evolved from 1989 to 2024 in this infographic. View Online | Subscribe | Download Our App Download our app to see thousands of new charts from
The Great Social Media Diaspora & Tapestry is here
Friday, February 14, 2025
Apple introduces new app called 'Apple Invites', The Iconfactory launches Tapestry, beyond the traditional portfolio, and more in this week's issue of Creativerly. Creativerly The Great
Daily Coding Problem: Problem #1689 [Medium]
Friday, February 14, 2025
Daily Coding Problem Good morning! Here's your coding interview problem for today. This problem was asked by Google. Given a linked list, sort it in O(n log n) time and constant space. For example,
📧 Stop Conflating CQRS and MediatR
Friday, February 14, 2025
Stop Conflating CQRS and MediatR Read on: my website / Read time: 4 minutes The .NET Weekly is brought to you by: Step right up to the Generative AI Use Cases Repository! See how MongoDB powers your