Data Elixir - Data Elixir - Issue 444
ISSUE 444 · July 18, 2023Posts & TutorialsGetting started with Code InterpreterOpenAI's Code Interpreter is a general-purpose toolbox that gives GPT-4 superpowers. With Code Interpreter, GPT-4 can ingest up to 100MB of your data and then use that data in python scripts that it writes and executes. That allows GPT-4 to do all sorts of things it couldn’t do before. This post is a nice tour of what's possible now and how to use it. VScode + Docker + Python= ❤️ ❤️ ❤️This is a great guide for setting up a Python development environment with VScode and Docker. It starts with a section that explains the advantages of each tool and how they work well together. From there, it's an easy to follow, step-by-step tutorial for setting everything up. Introduction to dimensionality reductionDimensionality reduction helps to simplify complex datasets and make them more manageable to work with. In this two-part introduction, Gabe Flomo shows how dimensionality reduction works and offers practical examples for common algorithms using python. Hex | Gabe Flomo Sponsored LinkComputational linguistics in the age of large language modelsWhat are the challenges facing LLMs? Amazon senior principal scientist and ACL 2023 general chair Yang Liu highlights the problem of “hallucination”, or generating false assertions, and explains how scientists are trying to address it. Tools & CodeAn Open-source Plotting Library for Statistical DataLets-Plot is a python plotting library for statistical data. It's based on the Grammer of Graphics and largely follows the gpplot2 API. This looks like a very nice plotting library that supports a wide range of chart types and features, such as data sampling, formatting, geocoding, notebook compatibility, and much more. Introduction to theft{theft} is an R package that provides a structured analytical workflow for the extraction, analysis, and visualisation of time-series features. It's designed to be flexible and extensible and it provides standardized access to key R packages as well as python libraries. PapersRegulating Frontier AI: To Open Source or Not?Two important new papers grapple with how to govern emerging and increasingly powerful "frontier AI" models. The first paper is a collaboration between Big Tech players like OpenAI, Google, Microsoft, etc. It argues for self-regulation with government oversight. The second paper, by Jeremy Howard, counters with an open-source approach. This is a great post that thoughtfully summarizes the issues. Financial Machine LearningThis new survey paper explores the nascent literature on machine learning in financial markets. Along with an extensive review, the paper highlights the best examples of what this line of research has to offer with recommendations for future research. ResourcesAdvanced Python MasteryThis course introduces Python's more advanced features and is intended to help you understand how to control the behavior of the language and bend it in ways that serve the needs of your application. This is an exercise-driven course that's free and self-paced. R for Data Science (2e)A new, second edition of R for Data Science was recently released and it's a big update. In this edition, visualization is more thoroughly covered, the programming section has been rewritten to focus on function writing and iteration, and there's a new section for accessing data from databases, spreadsheets, and the web. Free to read online. Was this email forwarded to you? Sign up here >> |
Older messages
Data Elixir - Issue 443
Tuesday, July 11, 2023
Unraveling PCA. Hidden tools in Python. SQL inner joins. Statistical learning for python. Demystifying text data. How to do great work.
Data Elixir - Issue 442
Tuesday, June 27, 2023
Polars cookbook. LLM-powered autonomous agents. Time series with ML. Scalable & extensible viz. ML system design.
Data Elixir - Issue 441
Tuesday, June 20, 2023
Julia programming for ML. Spatial statistics. Raincloud plots. Artifact corrections for effect sizes. Perils of faking data in Excel. Private LLMs for DB interactions.
Data Elixir - Issue 440
Tuesday, June 13, 2023
NFL Analytics. Sequential testing. Data + Music. Managing generative AI risks. FinGPT: open-source LLM for finance. Data exploration toolkit.
Data Elixir - Issue 439
Monday, June 12, 2023
Data podcasts. What are embeddings? Road trip maps. Dependency management. The {marginaleffects} book. A first course in causal inference.
You Might Also Like
Import AI 399: 1,000 samples to make a reasoning model; DeepSeek proliferation; Apple's self-driving car simulator
Friday, February 14, 2025
What came before the golem? ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
Defining Your Paranoia Level: Navigating Change Without the Overkill
Friday, February 14, 2025
We've all been there: trying to learn something new, only to find our old habits holding us back. We discussed today how our gut feelings about solving problems can sometimes be our own worst enemy
5 ways AI can help with taxes 🪄
Friday, February 14, 2025
Remotely control an iPhone; 💸 50+ early Presidents' Day deals -- ZDNET ZDNET Tech Today - US February 10, 2025 5 ways AI can help you with your taxes (and what not to use it for) 5 ways AI can help
Recurring Automations + Secret Updates
Friday, February 14, 2025
Smarter automations, better templates, and hidden updates to explore 👀 ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
The First Provable AI-Proof Game: Introducing Butterfly Wings 4
Friday, February 14, 2025
Top Tech Content sent at Noon! Boost Your Article on HackerNoon for $159.99! Read this email in your browser How are you, @newsletterest1? undefined The Market Today #01 Instagram (Meta) 714.52 -0.32%
GCP Newsletter #437
Friday, February 14, 2025
Welcome to issue #437 February 10th, 2025 News BigQuery Cloud Marketplace Official Blog Partners BigQuery datasets now available on Google Cloud Marketplace - Google Cloud Marketplace now offers
Charted | The 1%'s Share of U.S. Wealth Over Time (1989-2024) 💰
Friday, February 14, 2025
Discover how the share of US wealth held by the top 1% has evolved from 1989 to 2024 in this infographic. View Online | Subscribe | Download Our App Download our app to see thousands of new charts from
The Great Social Media Diaspora & Tapestry is here
Friday, February 14, 2025
Apple introduces new app called 'Apple Invites', The Iconfactory launches Tapestry, beyond the traditional portfolio, and more in this week's issue of Creativerly. Creativerly The Great
Daily Coding Problem: Problem #1689 [Medium]
Friday, February 14, 2025
Daily Coding Problem Good morning! Here's your coding interview problem for today. This problem was asked by Google. Given a linked list, sort it in O(n log n) time and constant space. For example,
📧 Stop Conflating CQRS and MediatR
Friday, February 14, 2025
Stop Conflating CQRS and MediatR Read on: my website / Read time: 4 minutes The .NET Weekly is brought to you by: Step right up to the Generative AI Use Cases Repository! See how MongoDB powers your