Data Elixir - Data Elixir - Issue 438
ISSUE 438 · May 30, 2023Favorite Data Podcasts?If you have a favorite data podcast, cast your vote here and we'll report the top picks in an upcoming issue of the newsletter 👉 Talks & ConferencesState of GPTIn this session from last week's Microsoft Build, Andrej Karpathy describes the pipeline for training bots like ChatGPT. From there, he dives into into practical techniques for using GPT effectively, including prompting techniques, finetuning, tools, and things to expect. This is a great talk but if you're short on time, see Alex Volkov's notes 👉Microsoft Build | Andrej Karpathy — 43 minutes Malloy - An Experimental Language for DataForcing data through a rectangle shapes the way we solve problems (e.g. dimensional fact tables, OLAP Cubes). But most data isn't rectangular — it's hierarchical. In this talk, Lloyd Tabb describes a new data programming language that transcends the rectangle paradigm and breaks long held misconceptions in the way we analyze data. Sponsored LinkDatalore. A collaborative data science platform.Data science teams face many challenges when trying to optimize their processes and ship research results and machine learning models faster. Datalore has become a game-changing solution for data teams across industries, enabling ergonomic data access, effortless collaboration, and easy reporting via Jupyter notebooks. Try Datalore for free Posts & TutorialsIntro to Vega-LiteVega-Lite is a high-level language for rapidly creating interactive visualizations. It includes support for a variety of data and visual transformations and doesn't need a lot of code. This multi-part tutorial introduces Vega-Lite and offers a variety of step-by-step examples. Choosing a good file format for PandasThere are plenty of data formats supported by Pandas. Which should you choose and why? Capturing Output to External FilesThe sink() function in R is used to divert R output to an external connection. This can be useful for a variety of uses, such as exporting data to a file, logging R output, or debugging code. Here's how it works. Some Intuition on Attention and the TransformerAs ChatGPT and other LLMs get thrust into the mainstream, more people outside of ML and NLP circles are trying to better understand Attention and the Transformer. Here are some answers to common questions, with a focus on conveying the intuition. Google Advanced Data Analytics CertificateThe Google Advanced Data Analytics Professional Certificate is a 7-course series that focuses on building regression and machine learning models, applying statistical methods to investigate data, creating data visualizations, and communicating insights from data analysis to stakeholders. The course is run by Coursera and is free to get started. PapersTree of Thoughts: Problem Solving with LLMsLanguage models fall short in tasks that require exploration, strategic lookahead, or where initial decisions play a pivotal role. You may have heard of "Chain of Thought" prompting to help overcome these issues. "Tree of Thought" works much better. LIMA: Less Is More for AlignmentResearchers at Meta have shown that remarkably capable LLMs can be achieved with only 1,000 carefully curated examples. This could be a game-changer for researchers and small-scale developers. ResourcesData Science Interview - Questions & AnswersNice collection of data science interview questions and answers. There are 100+ questions here, covering machine learning, statistics, probability, python, SQL, and more. Was this email forwarded to you? Sign up here >> |
Key phrases
Older messages
Data Elixir - Issue 437
Tuesday, May 23, 2023
How db indexes work. ML vs climate change. Word salad. Guide to MLOps. Intro to data viz for the web.
Data Elixir - Issue 436
Tuesday, May 16, 2023
privateGPT. Julia 1.9 highlights. Built on probability. Tidy Finance. Python packaging.
Data Elixir - Issue 435
Tuesday, May 9, 2023
Demand forecasting. Cookbook for Self-Supervised Learning. Mojo, a hot new programming language. Causal inference for data analysis. Optical illusions in viz.
Issue 434
Tuesday, May 2, 2023
p values for A/B tests? Synthetic data. Understanding LLMs. Awesome ggplot2 🕶️.
Data Elixir - Issue 433
Tuesday, April 25, 2023
ML design patterns. Analysis with SQLite and Python. A/B testing resources. Performant tidy code. How to run surveys.
You Might Also Like
Charted | The Carbon Footprint of Major Travel Methods 🌐
Friday, May 3, 2024
Transport accounts for nearly one-quarter of global energy-related CO2 emissions. This chart shows the carbon footprint of travel methods. View Online | Subscribe Presented by: Morningstar Discover the
Apple's AI Strategy, At Your Service
Friday, May 3, 2024
The relative calm before the "AI, AI, AI, AI, AI" storm... Apple's AI Strategy, At Your Service By MG Siegler • 3 May 2024 View in browser View in browser At one point during Apple's
5 gadgets I never fly without
Friday, May 3, 2024
How to save on internet; BYO AI; Gemini features we need; Prime Day 2024 -- ZDNET ZDNET Tech Today - US May 3, 2024 placeholder I fly 10 times a year. These 5 tech gadgets are lifesavers From recording
⚙️ Microsoft bans the police from using their AI
Friday, May 3, 2024
Plus: The first AI diplomat is here
Weekend Read: Private and Medical AI 🕵️♀️
Friday, May 3, 2024
Don't accidentally train GPT-5
Microsoft thinks generative AI and faces shouldn’t mix
Friday, May 3, 2024
The company has banned its AI from being used for face recognition View this email online in your browser By Alex Wilhelm Friday, May 3, 2024 Welcome to TechCrunch AM! Today, we have notes on
Issue #539: Discord with Colyseus, parametric surface, and StrikeForce Kitty
Friday, May 3, 2024
Weekly newsletter about HTML5 Game Development. Is this email not displaying correctly? View it in your browser. Issue #539 - May 3rd 2024 If you have anything you want to share with the HTML5 game
Peloton's grim post-pandemic reality
Friday, May 3, 2024
The Morning After It's Friday, May 03, 2024. Peloton had a great pandemic. It's a weird thing to say, but the company's premium exercise equipment (expanding from bikes to treadmills and
Four Critical Vulnerabilities Expose HPE Aruba Devices to RCE Attacks
Friday, May 3, 2024
THN Daily Updates Newsletter cover Enterprise Transformation to AI and the Metaverse ($59.99 Value) FREE for a Limited Time Strategies for the Technology Revolution Download Now Sponsored LATEST NEWS
Post from Syncfusion Blogs on 05/03/2024
Friday, May 3, 2024
New blogs from Syncfusion Create Interactive Floor Planner Diagrams using Blazor Diagram Library By Keerthivasan R This blog explains how to create interactive floor planner diagrams using the