Data Elixir - Data Elixir - Issue 441
ISSUE 441 · June 20, 2023In the NewsDo Foundation Models Comply with the EU AI Act?In this post from Stanford, researchers evaluate foundation model providers like OpenAI and Google for their compliance with the recently proposed EU regulations on AI. The post scores each of the models on the key issues and offers recommendations. Ultimately, the entire ecosystem will benefit from working towards compliance but it's not clear how, or if, that will happen. Sponsored LinkWebinar: How to generate business intelligence leveraging Yelp's rich first-party data on AWSDiscover how to create actionable insights using Yelp's robust data sets to analyze your marketplace, your customers, and grow your business. Explore use cases on how businesses leverage this rich data with AWS Data Exchange to make strategic business decisions. Date: July 19, 2023 Posts & TutorialsData Falsificada (Part 1): "Clusterfake"In this first post in a series on academic fraud, researchers explore a case where two different people independently faked data for two different studies in a paper about dishonesty! Besides the backstory, what's interesting are the techniques the researchers used to dissect Excel files. There's a lot more to those files than most people realize. Artifact corrections for effect sizesAn effect size is a way to quantify the difference between two groups. While p-values can tell you whether an effect exists, effect sizes can tell you how large that effect is. But to be useful, effect sizes need to be corrected for a variety of statistical artifacts, such as measurement error. This post walks through nearly all artifact corrections and includes equations, code snippets and an interactive learning app. What Makes Raincloud Plots Tick?A raincloud plot combines visualizations of the overall shape of a distribution, the raw data values, and relevant statistics. This is a nice explainer that explores how raincloud plots are useful and things to think about for their design. The post introduces a larger project on raincloud plots that includes a paper and a notebook with examples. 5 methods to detect drift in ML embeddingsThis post explores the problem of drift in ML embeddings and a variety of techniques to monitor it. For each technique, there's a description of how it works, pros/cons, and experimental results. Tools & CodeDB-GPT - Database Interactions with Private LLMsDB-GPT is an experimental open-source project that uses local LLMs to enable you to interact with your data in natural language. Use it to generate SQL, diagnose SQL issues, provide natural language Q/A with knowledge bases, chat with documents, etc. Privacy and security are core objectives and all of your data stays in your own environment. GPT EngineerGPT Engineer is an AI agent that can write an entire codebase with a prompt. Specify what you want it to build, the AI asks for clarification, and then builds it. It's made to be easy to adapt and it even learns how you want your code to look. This has been out for less than a week and already has more than 22K stars on GitHub. ResourcesSpatial Statistics for Data ScienceThis new book introduces the theory and practice of spatial statistics using R. Covers packages for working with spatial data, the various types of spatial data and how to access it, making maps, spatial autocorrelation, Bayesian spatial models, and more. Free to read online. Julia programming for MLThis notebook-based course introduces Julia's machine learning ecosystem and will teach you how to write reproducible, unit-tested Julia code along the way. Prior experience with Julia is not required. Covers Julia fundamentals (e.g. plotting, data frames, classical ML), deep learning, a personal project, and finishes with debugging and profiling. Was this email forwarded to you? Sign up here >> |
Older messages
Data Elixir - Issue 440
Tuesday, June 13, 2023
NFL Analytics. Sequential testing. Data + Music. Managing generative AI risks. FinGPT: open-source LLM for finance. Data exploration toolkit.
Data Elixir - Issue 439
Monday, June 12, 2023
Data podcasts. What are embeddings? Road trip maps. Dependency management. The {marginaleffects} book. A first course in causal inference.
Data Elixir - Issue 438
Tuesday, May 30, 2023
State of GPT. Interview questions and answers. Hierarchical vs rectangular data. Intro to Vega-Lite.
Data Elixir - Issue 437
Tuesday, May 23, 2023
How db indexes work. ML vs climate change. Word salad. Guide to MLOps. Intro to data viz for the web.
Data Elixir - Issue 436
Tuesday, May 16, 2023
privateGPT. Julia 1.9 highlights. Built on probability. Tidy Finance. Python packaging.
You Might Also Like
Import AI 399: 1,000 samples to make a reasoning model; DeepSeek proliferation; Apple's self-driving car simulator
Friday, February 14, 2025
What came before the golem? ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
Defining Your Paranoia Level: Navigating Change Without the Overkill
Friday, February 14, 2025
We've all been there: trying to learn something new, only to find our old habits holding us back. We discussed today how our gut feelings about solving problems can sometimes be our own worst enemy
5 ways AI can help with taxes 🪄
Friday, February 14, 2025
Remotely control an iPhone; 💸 50+ early Presidents' Day deals -- ZDNET ZDNET Tech Today - US February 10, 2025 5 ways AI can help you with your taxes (and what not to use it for) 5 ways AI can help
Recurring Automations + Secret Updates
Friday, February 14, 2025
Smarter automations, better templates, and hidden updates to explore 👀 ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
The First Provable AI-Proof Game: Introducing Butterfly Wings 4
Friday, February 14, 2025
Top Tech Content sent at Noon! Boost Your Article on HackerNoon for $159.99! Read this email in your browser How are you, @newsletterest1? undefined The Market Today #01 Instagram (Meta) 714.52 -0.32%
GCP Newsletter #437
Friday, February 14, 2025
Welcome to issue #437 February 10th, 2025 News BigQuery Cloud Marketplace Official Blog Partners BigQuery datasets now available on Google Cloud Marketplace - Google Cloud Marketplace now offers
Charted | The 1%'s Share of U.S. Wealth Over Time (1989-2024) 💰
Friday, February 14, 2025
Discover how the share of US wealth held by the top 1% has evolved from 1989 to 2024 in this infographic. View Online | Subscribe | Download Our App Download our app to see thousands of new charts from
The Great Social Media Diaspora & Tapestry is here
Friday, February 14, 2025
Apple introduces new app called 'Apple Invites', The Iconfactory launches Tapestry, beyond the traditional portfolio, and more in this week's issue of Creativerly. Creativerly The Great
Daily Coding Problem: Problem #1689 [Medium]
Friday, February 14, 2025
Daily Coding Problem Good morning! Here's your coding interview problem for today. This problem was asked by Google. Given a linked list, sort it in O(n log n) time and constant space. For example,
📧 Stop Conflating CQRS and MediatR
Friday, February 14, 2025
Stop Conflating CQRS and MediatR Read on: my website / Read time: 4 minutes The .NET Weekly is brought to you by: Step right up to the Generative AI Use Cases Repository! See how MongoDB powers your