Data Elixir - Data Elixir - Issue 393
ISSUE 393 · June 28, 2022TrendsThe evolution of machine learning infrastructureBudgets for data and machine learning are hitting all-time highs and lots of value is being created across the space. In this post from Bessemer Venture Partners, Bhavik Nagda and Sakib Dadi explore what’s been happening in the machine learning technology stack, the key trends driving innovation, and where the opportunities are now. Fair GameWeb scraping is widely used by industry and researchers and has been deemed to be legal by courts in the U.S. Even so, using scraped data in contexts that it wasn't intended is rife with problems. This is a great article about the complicated dance between privacy, research, and the industries that have evolved around scraped data. Sponsored LinkHow to Capture Advantages by Investing in High-Quality Training DataAt the enterprise level, machine learning requires either large amounts of training data or a smaller set of extremely high quality data, as well as the infrastructure to support high data volumes. Consequently, labeling data through robust software or in partnership with an annotation service provider is critical to project success. Read more. Tutorials, Projects & OpinionsThings You Should Know About DatabasesIt's surprisingly easy to work with databases and not really understand basic principles about how they work. In this primer, Mahdi Yusuf walks through two of the most important topics when working with relational database systems: indexes and transactions. One year as a solo dev building open-source data tools without fundingGreat, first-hand account of turning a side project into something big. Or, bigger, anyway. The reality is, going solo is complicated but, in spite of that, there's been a lot of interest in Phil Eaton's DataStation recently. This post covers the journey so far: from problem definition to software
development to VC interest, and 4000 stars on GitHub! Why I still recommend JuliaA few weeks ago, there were lengthy online discussions about bugs in the Julia ecosystem and why it shouldn't be used when correctness matters. In this counter-post, Rik Huijzer walks through the things that Julia gets right and why, even though Julia isn't perfect, he still recommends it. Includes links to key discussions. OpenBB TerminalOpenBB Terminal is an open-source, Python-based platform for investment research. It's inspired by Bloomberg Terminal but without the fees. It gives you access to massive amounts of real-time and historical data and since it's Python, you can use common data science libraries with it like Pandas, Numpy, Scipy, Jupyter, Pytorch, etc. Data Product in Changing Environments: Rethinking and Updating InvestmentsDatasets and tools that are built for specific people in an organization can easily be forgotten when those people leave. If you're the one building those tools, here's a better way to approach new projects to make sure your time is well spent. Top 3 ways IMDb’s third-party data enhances user engagement and experienceJoin this virtual event to learn the top 3 ways to increase customer engagement and improve user experience by leveraging IMDb’s third-party data. Speakers from AWS and IMDb will show you how to gain new consumer insights, deploy customer-centric features, and create personalized user experiences. Register Now CareerWorkforce Wanted - Data Talent For Social ImpactThis new report explores the rapidly growing landscape of data for social good. The report is primarily focused on ways to develop a global talent force of purpose-driven data professionals and along the way, it highlights issues and opportunities. Data VisualizationHard Data: The Erotics Of InfographicsThis is an edgy take on infographics but it sure is compelling: at the end of the day, much of the media's interest in data lies in its monetization. And it turns out, visualization and the data fetishism that the media has fostered with it has created an easy way to convert data to dollars. Was this email forwarded to you? Sign up here >> |
Older messages
Data Elixir - Issue 392
Tuesday, June 21, 2022
Graph ML intro. JTBD for data teams. How to get data out of PDFs. Research highlights from Meta. Reproducible research workflows.
Data Elixir - Issue 391
Tuesday, June 14, 2022
ML design patterns. Awesome data leadership. Faster Pandas. ML experimentation in VS Code.
Data Elixir - Issue 390
Tuesday, June 7, 2022
Friendlier SQL. Decision Intelligence framework. Collective data rights.
Data Elixir - Issue 389
Tuesday, May 31, 2022
The technical pay gap. DS: Foundations, Challenges, Opportunities. Existential threat of data quality. ML visual explainers.
Data Elixir - Issue 388
Tuesday, May 24, 2022
Software development for DS. How random forests really work. Visualizing multicollinearity. ML for conservation.
You Might Also Like
WP Weekly 226 - Launches - New Elementor Theme, WP 6.8 in April 2025, Automattic Scale Back
Monday, January 13, 2025
Read on Website WP Weekly 226 / Launches 2025 has just started, and there is a slew of new launches like Hello Biz Theme, Meta Box Lite, FooConvert, Affililink, and more. Also, the next WordPress 6.8
SRE Weekly Issue #459
Monday, January 13, 2025
View on sreweekly.com A message from our sponsor, incident.io: Effective incident management demands coordination and collaboration to minimize disruptions. This guide by incident.io covers the full
Saving One Screen At A Time 🖥️
Monday, January 13, 2025
Why the screen saver stopped being so in-your-face. Here's a version for your browser. Hunting for the end of the long tail • January 12, 2025 Today in Tedium: Having seen a lot of pipes, wavy
Software Testing Weekly - Issue 253
Monday, January 13, 2025
Software Testing Weekly turns 5! 🥳 View on the Web Archives ISSUE 253 January 13th 2025 COMMENT Welcome to the 253rd issue! Oh my, time flies! It's hard to believe this week marks 5 years since I
CES 2025 - Sync #501
Sunday, January 12, 2025
Plus: Sam Altman reflects on the last two years; Anthropic reportedly in talks to raise $2B at $60B valuation; e-tattoo decodes brainwaves; anthrobots; top 25 biotech companies for 2025; and more! ͏ ͏
PD#608 Mistakes engineers make in large established codebases
Sunday, January 12, 2025
You can't practice it beforehand ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
C#539 A detailed look at EF Core’s JSON Columns feature
Sunday, January 12, 2025
Comparing it with the traditional tables with indexes
RD#488 How to avoid issues with custom Hooks
Sunday, January 12, 2025
Using them carelessly can lead to many problems
Daily Coding Problem: Problem #1666 [Easy]
Sunday, January 12, 2025
Daily Coding Problem Good morning! Here's your coding interview problem for today. This problem was asked by Amazon. Given n numbers, find the greatest common denominator between them. For example,
🛜 Here's What Happens to Old Websites — Features the Pixel Should Copy From Samsung's One UI 7
Sunday, January 12, 2025
Also: What Instagram Needs to Compete With TikTok, and More! How-To Geek Logo January 12, 2025 Did You Know Mount Wingen, located near Wingen, New South Wales in Australia, is better known as Burning