Data Elixir - Data Elixir - Issue 393
ISSUE 393 · June 28, 2022TrendsThe evolution of machine learning infrastructureBudgets for data and machine learning are hitting all-time highs and lots of value is being created across the space. In this post from Bessemer Venture Partners, Bhavik Nagda and Sakib Dadi explore what’s been happening in the machine learning technology stack, the key trends driving innovation, and where the opportunities are now. Fair GameWeb scraping is widely used by industry and researchers and has been deemed to be legal by courts in the U.S. Even so, using scraped data in contexts that it wasn't intended is rife with problems. This is a great article about the complicated dance between privacy, research, and the industries that have evolved around scraped data. Sponsored LinkHow to Capture Advantages by Investing in High-Quality Training DataAt the enterprise level, machine learning requires either large amounts of training data or a smaller set of extremely high quality data, as well as the infrastructure to support high data volumes. Consequently, labeling data through robust software or in partnership with an annotation service provider is critical to project success. Read more. Tutorials, Projects & OpinionsThings You Should Know About DatabasesIt's surprisingly easy to work with databases and not really understand basic principles about how they work. In this primer, Mahdi Yusuf walks through two of the most important topics when working with relational database systems: indexes and transactions. One year as a solo dev building open-source data tools without fundingGreat, first-hand account of turning a side project into something big. Or, bigger, anyway. The reality is, going solo is complicated but, in spite of that, there's been a lot of interest in Phil Eaton's DataStation recently. This post covers the journey so far: from problem definition to software
development to VC interest, and 4000 stars on GitHub! Why I still recommend JuliaA few weeks ago, there were lengthy online discussions about bugs in the Julia ecosystem and why it shouldn't be used when correctness matters. In this counter-post, Rik Huijzer walks through the things that Julia gets right and why, even though Julia isn't perfect, he still recommends it. Includes links to key discussions. OpenBB TerminalOpenBB Terminal is an open-source, Python-based platform for investment research. It's inspired by Bloomberg Terminal but without the fees. It gives you access to massive amounts of real-time and historical data and since it's Python, you can use common data science libraries with it like Pandas, Numpy, Scipy, Jupyter, Pytorch, etc. Data Product in Changing Environments: Rethinking and Updating InvestmentsDatasets and tools that are built for specific people in an organization can easily be forgotten when those people leave. If you're the one building those tools, here's a better way to approach new projects to make sure your time is well spent. Top 3 ways IMDb’s third-party data enhances user engagement and experienceJoin this virtual event to learn the top 3 ways to increase customer engagement and improve user experience by leveraging IMDb’s third-party data. Speakers from AWS and IMDb will show you how to gain new consumer insights, deploy customer-centric features, and create personalized user experiences. Register Now CareerWorkforce Wanted - Data Talent For Social ImpactThis new report explores the rapidly growing landscape of data for social good. The report is primarily focused on ways to develop a global talent force of purpose-driven data professionals and along the way, it highlights issues and opportunities. Data VisualizationHard Data: The Erotics Of InfographicsThis is an edgy take on infographics but it sure is compelling: at the end of the day, much of the media's interest in data lies in its monetization. And it turns out, visualization and the data fetishism that the media has fostered with it has created an easy way to convert data to dollars. Was this email forwarded to you? Sign up here >> |
Older messages
Data Elixir - Issue 392
Tuesday, June 21, 2022
Graph ML intro. JTBD for data teams. How to get data out of PDFs. Research highlights from Meta. Reproducible research workflows.
Data Elixir - Issue 391
Tuesday, June 14, 2022
ML design patterns. Awesome data leadership. Faster Pandas. ML experimentation in VS Code.
Data Elixir - Issue 390
Tuesday, June 7, 2022
Friendlier SQL. Decision Intelligence framework. Collective data rights.
Data Elixir - Issue 389
Tuesday, May 31, 2022
The technical pay gap. DS: Foundations, Challenges, Opportunities. Existential threat of data quality. ML visual explainers.
Data Elixir - Issue 388
Tuesday, May 24, 2022
Software development for DS. How random forests really work. Visualizing multicollinearity. ML for conservation.
You Might Also Like
Power BI Weekly #285 - 19th November 2024
Tuesday, November 19, 2024
Power BI Weekly Newsletter Issue #285 powered by endjin Welcome to the 285th edition of Power BI Weekly! Quite a short one this week. A couple of people have written about the new Path Layer feature
Software Testing Weekly - Issue 246
Tuesday, November 19, 2024
Highlights from the 10th DORA report by Google 📈 View on the Web Archives ISSUE 246 November 19th 2024 COMMENT Welcome to the 246th issue! It's hard to believe that DORA metrics have been around
💻 Installing Linux on an Old Laptop Instead of a Raspberry Pi — Flagship Phones Need More Storage
Monday, November 18, 2024
Also: I Built the Perfect Programming Platform In Less Than 10 Minutes, and More! How-To Geek Logo November 18, 2024 Did You Know The Sixth Sense was the highest-grossing horror film of all time in
Daily Coding Problem: Problem #1612 [Hard]
Monday, November 18, 2024
Daily Coding Problem Good morning! Here's your coding interview problem for today. This problem was asked by Etsy. Given a sorted array, convert it into a height-balanced binary search tree.
10,000 ways to fail & The European Search Perspective
Monday, November 18, 2024
Reflecting on over five years of Creativerly, Signal introduces Call Links, the science of mental models, and a lot more in this week's issue of Creativerly. Creativerly 10000 ways to fail &
Charted | Global GHG Emissions, by Sector 🌎
Monday, November 18, 2024
In this graphic, we show greenhouse gas emissions by sector in 2023. View Online | Subscribe | Download Our App Presented by: New 3-Part Series: Bitcoin Demystified >> Learn more about one of the
Spyglass Dispatch: Samsung/Google Smart Glasses • Star Wars Mess • Netflix Knocked Out • Conan's Oscars • MicroStrategy's Comeback • Vision Pro In Focus • Saving 'Inside the NBA' • Apple Television Lives!
Monday, November 18, 2024
Samsung/Google Smart Glasses • Star Wars Mess • Netflix Knocked Out • Conan's Oscars • MicroStrategy's Comeback • Vision Pro In Focus • Saving 'Inside the NBA' • Apple Television Lives!
GCP Newsletter #424
Monday, November 18, 2024
Welcome to issue #425 November 18th, 2024 News Google Kubernetes Engine Official Blog 65000 nodes and counting: Google Kubernetes Engine is ready for trillion-parameter AI models - Google Kubernetes
Design and code beautiful products. Together.
Monday, November 18, 2024
Pablo Ruiz-Múzquiz and the team at Penpot have recently announced a new plugin feature that allows users to build new tools and functionalities on the platform. ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
Can Bitcoin Put an End to Forever War?
Monday, November 18, 2024
Top Tech Content sent at Noon! How the world collects web data Read this email in your browser How are you, @newsletterest1? 🪐 What's happening in tech today, November 18, 2024? The HackerNoon