Data Elixir - Data Elixir - Issue 438
ISSUE 438 · May 30, 2023Favorite Data Podcasts?If you have a favorite data podcast, cast your vote here and we'll report the top picks in an upcoming issue of the newsletter 👉 Talks & ConferencesState of GPTIn this session from last week's Microsoft Build, Andrej Karpathy describes the pipeline for training bots like ChatGPT. From there, he dives into into practical techniques for using GPT effectively, including prompting techniques, finetuning, tools, and things to expect. This is a great talk but if you're short on time, see Alex Volkov's notes 👉Microsoft Build | Andrej Karpathy — 43 minutes Malloy - An Experimental Language for DataForcing data through a rectangle shapes the way we solve problems (e.g. dimensional fact tables, OLAP Cubes). But most data isn't rectangular — it's hierarchical. In this talk, Lloyd Tabb describes a new data programming language that transcends the rectangle paradigm and breaks long held misconceptions in the way we analyze data. Sponsored LinkDatalore. A collaborative data science platform.Data science teams face many challenges when trying to optimize their processes and ship research results and machine learning models faster. Datalore has become a game-changing solution for data teams across industries, enabling ergonomic data access, effortless collaboration, and easy reporting via Jupyter notebooks. Try Datalore for free Posts & TutorialsIntro to Vega-LiteVega-Lite is a high-level language for rapidly creating interactive visualizations. It includes support for a variety of data and visual transformations and doesn't need a lot of code. This multi-part tutorial introduces Vega-Lite and offers a variety of step-by-step examples. Choosing a good file format for PandasThere are plenty of data formats supported by Pandas. Which should you choose and why? Capturing Output to External FilesThe sink() function in R is used to divert R output to an external connection. This can be useful for a variety of uses, such as exporting data to a file, logging R output, or debugging code. Here's how it works. Some Intuition on Attention and the TransformerAs ChatGPT and other LLMs get thrust into the mainstream, more people outside of ML and NLP circles are trying to better understand Attention and the Transformer. Here are some answers to common questions, with a focus on conveying the intuition. Google Advanced Data Analytics CertificateThe Google Advanced Data Analytics Professional Certificate is a 7-course series that focuses on building regression and machine learning models, applying statistical methods to investigate data, creating data visualizations, and communicating insights from data analysis to stakeholders. The course is run by Coursera and is free to get started. PapersTree of Thoughts: Problem Solving with LLMsLanguage models fall short in tasks that require exploration, strategic lookahead, or where initial decisions play a pivotal role. You may have heard of "Chain of Thought" prompting to help overcome these issues. "Tree of Thought" works much better. LIMA: Less Is More for AlignmentResearchers at Meta have shown that remarkably capable LLMs can be achieved with only 1,000 carefully curated examples. This could be a game-changer for researchers and small-scale developers. ResourcesData Science Interview - Questions & AnswersNice collection of data science interview questions and answers. There are 100+ questions here, covering machine learning, statistics, probability, python, SQL, and more. Was this email forwarded to you? Sign up here >> |
Older messages
Data Elixir - Issue 437
Tuesday, May 23, 2023
How db indexes work. ML vs climate change. Word salad. Guide to MLOps. Intro to data viz for the web.
Data Elixir - Issue 436
Tuesday, May 16, 2023
privateGPT. Julia 1.9 highlights. Built on probability. Tidy Finance. Python packaging.
Data Elixir - Issue 435
Tuesday, May 9, 2023
Demand forecasting. Cookbook for Self-Supervised Learning. Mojo, a hot new programming language. Causal inference for data analysis. Optical illusions in viz.
Issue 434
Tuesday, May 2, 2023
p values for A/B tests? Synthetic data. Understanding LLMs. Awesome ggplot2 🕶️.
Data Elixir - Issue 433
Tuesday, April 25, 2023
ML design patterns. Analysis with SQLite and Python. A/B testing resources. Performant tidy code. How to run surveys.
You Might Also Like
GCP Newsletter #424
Monday, November 18, 2024
Welcome to issue #425 November 18th, 2024 News Google Kubernetes Engine Official Blog 65000 nodes and counting: Google Kubernetes Engine is ready for trillion-parameter AI models - Google Kubernetes
Design and code beautiful products. Together.
Monday, November 18, 2024
Pablo Ruiz-Múzquiz and the team at Penpot have recently announced a new plugin feature that allows users to build new tools and functionalities on the platform. ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
Can Bitcoin Put an End to Forever War?
Monday, November 18, 2024
Top Tech Content sent at Noon! How the world collects web data Read this email in your browser How are you, @newsletterest1? 🪐 What's happening in tech today, November 18, 2024? The HackerNoon
25 tips for programming with AI
Monday, November 18, 2024
Meta Quest dominates Steam VR; Stop squirting hot glue into devices -- ZDNET ZDNET Tech Today - US November 18, 2024 digitalspeed-gettyimages-1322205545 25 AI tips to boost your programming
Ordering, Grouping and Consistency in Messaging systems
Monday, November 18, 2024
We went quite far from our Queue Broker series in recent editions, but today, we're back to it! By powers combined, I joined our Queue Broker implementation to solve the generic idempotency check
⚡ THN Recap: Top Cybersecurity Threats, Tools, and Practices (Nov 11 - Nov 17)
Monday, November 18, 2024
Ready to outsmart the hackers? 👇 Dive into this week's must-know updates.
Import AI 392: China releases another excellent coding model; generative models and robots; scaling laws for agent…
Monday, November 18, 2024
If aliens built AI, would it also use stochastic gradient descent? ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
⚙️ Musk's $6 billion
Monday, November 18, 2024
Plus: We chat with an AI venture capitalist
Post from Syncfusion Blogs on 11/18/2024
Monday, November 18, 2024
New blogs from Syncfusion React vs. Next.js: Choosing the Right Framework By Prashant Yadav Learn the key differences between React and Next.js to choose the right framework for your web development
Gmail's New Shielded Email Feature Lets Users Create Aliases for Email Privacy
Monday, November 18, 2024
THN Daily Updates Newsletter cover [Watch LIVE] When Shift Happens: Are You Ready for Rapid Certificate Replacement? Revocations can disrupt your business, but automation saves the day. Discover how.