Data Elixir - Data Elixir - Issue 438
ISSUE 438 · May 30, 2023Favorite Data Podcasts?If you have a favorite data podcast, cast your vote here and we'll report the top picks in an upcoming issue of the newsletter 👉 Talks & ConferencesState of GPTIn this session from last week's Microsoft Build, Andrej Karpathy describes the pipeline for training bots like ChatGPT. From there, he dives into into practical techniques for using GPT effectively, including prompting techniques, finetuning, tools, and things to expect. This is a great talk but if you're short on time, see Alex Volkov's notes 👉Microsoft Build | Andrej Karpathy — 43 minutes Malloy - An Experimental Language for DataForcing data through a rectangle shapes the way we solve problems (e.g. dimensional fact tables, OLAP Cubes). But most data isn't rectangular — it's hierarchical. In this talk, Lloyd Tabb describes a new data programming language that transcends the rectangle paradigm and breaks long held misconceptions in the way we analyze data. Sponsored LinkDatalore. A collaborative data science platform.Data science teams face many challenges when trying to optimize their processes and ship research results and machine learning models faster. Datalore has become a game-changing solution for data teams across industries, enabling ergonomic data access, effortless collaboration, and easy reporting via Jupyter notebooks. Try Datalore for free Posts & Tutorials![]() Intro to Vega-LiteVega-Lite is a high-level language for rapidly creating interactive visualizations. It includes support for a variety of data and visual transformations and doesn't need a lot of code. This multi-part tutorial introduces Vega-Lite and offers a variety of step-by-step examples. Choosing a good file format for PandasThere are plenty of data formats supported by Pandas. Which should you choose and why? Capturing Output to External FilesThe sink() function in R is used to divert R output to an external connection. This can be useful for a variety of uses, such as exporting data to a file, logging R output, or debugging code. Here's how it works. Some Intuition on Attention and the TransformerAs ChatGPT and other LLMs get thrust into the mainstream, more people outside of ML and NLP circles are trying to better understand Attention and the Transformer. Here are some answers to common questions, with a focus on conveying the intuition. Google Advanced Data Analytics CertificateThe Google Advanced Data Analytics Professional Certificate is a 7-course series that focuses on building regression and machine learning models, applying statistical methods to investigate data, creating data visualizations, and communicating insights from data analysis to stakeholders. The course is run by Coursera and is free to get started. PapersTree of Thoughts: Problem Solving with LLMsLanguage models fall short in tasks that require exploration, strategic lookahead, or where initial decisions play a pivotal role. You may have heard of "Chain of Thought" prompting to help overcome these issues. "Tree of Thought" works much better. LIMA: Less Is More for AlignmentResearchers at Meta have shown that remarkably capable LLMs can be achieved with only 1,000 carefully curated examples. This could be a game-changer for researchers and small-scale developers. ResourcesData Science Interview - Questions & AnswersNice collection of data science interview questions and answers. There are 100+ questions here, covering machine learning, statistics, probability, python, SQL, and more. Was this email forwarded to you? Sign up here >> |
Older messages
Data Elixir - Issue 437
Tuesday, May 23, 2023
How db indexes work. ML vs climate change. Word salad. Guide to MLOps. Intro to data viz for the web.
Data Elixir - Issue 436
Tuesday, May 16, 2023
privateGPT. Julia 1.9 highlights. Built on probability. Tidy Finance. Python packaging.
Data Elixir - Issue 435
Tuesday, May 9, 2023
Demand forecasting. Cookbook for Self-Supervised Learning. Mojo, a hot new programming language. Causal inference for data analysis. Optical illusions in viz.
Issue 434
Tuesday, May 2, 2023
p values for A/B tests? Synthetic data. Understanding LLMs. Awesome ggplot2 🕶️.
Data Elixir - Issue 433
Tuesday, April 25, 2023
ML design patterns. Analysis with SQLite and Python. A/B testing resources. Performant tidy code. How to run surveys.
You Might Also Like
Import AI 399: 1,000 samples to make a reasoning model; DeepSeek proliferation; Apple's self-driving car simulator
Friday, February 14, 2025
What came before the golem? ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
Defining Your Paranoia Level: Navigating Change Without the Overkill
Friday, February 14, 2025
We've all been there: trying to learn something new, only to find our old habits holding us back. We discussed today how our gut feelings about solving problems can sometimes be our own worst enemy
5 ways AI can help with taxes 🪄
Friday, February 14, 2025
Remotely control an iPhone; 💸 50+ early Presidents' Day deals -- ZDNET ZDNET Tech Today - US February 10, 2025 5 ways AI can help you with your taxes (and what not to use it for) 5 ways AI can help
Recurring Automations + Secret Updates
Friday, February 14, 2025
Smarter automations, better templates, and hidden updates to explore 👀 ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
The First Provable AI-Proof Game: Introducing Butterfly Wings 4
Friday, February 14, 2025
Top Tech Content sent at Noon! Boost Your Article on HackerNoon for $159.99! Read this email in your browser How are you, @newsletterest1? undefined The Market Today #01 Instagram (Meta) 714.52 -0.32%
GCP Newsletter #437
Friday, February 14, 2025
Welcome to issue #437 February 10th, 2025 News BigQuery Cloud Marketplace Official Blog Partners BigQuery datasets now available on Google Cloud Marketplace - Google Cloud Marketplace now offers
Charted | The 1%'s Share of U.S. Wealth Over Time (1989-2024) 💰
Friday, February 14, 2025
Discover how the share of US wealth held by the top 1% has evolved from 1989 to 2024 in this infographic. View Online | Subscribe | Download Our App Download our app to see thousands of new charts from
The Great Social Media Diaspora & Tapestry is here
Friday, February 14, 2025
Apple introduces new app called 'Apple Invites', The Iconfactory launches Tapestry, beyond the traditional portfolio, and more in this week's issue of Creativerly. Creativerly The Great
Daily Coding Problem: Problem #1689 [Medium]
Friday, February 14, 2025
Daily Coding Problem Good morning! Here's your coding interview problem for today. This problem was asked by Google. Given a linked list, sort it in O(n log n) time and constant space. For example,
📧 Stop Conflating CQRS and MediatR
Friday, February 14, 2025
Stop Conflating CQRS and MediatR Read on: my website / Read time: 4 minutes The .NET Weekly is brought to you by: Step right up to the Generative AI Use Cases Repository! See how MongoDB powers your