Data Elixir - Data Elixir - Issue 366
ISSUE 366 · December 14, 2021InsightWhy is “Data Scientist” such a controversial title?There's a lot of debate about what "data science" is, exactly, and if "Data Scientists" should actually be called something else. The real problem is that "industry hasn’t learned how to take advantage of the special powers that 'scientists' bring to the table." Big Surveys and the Big Data ParadoxIncreasing the amount of data you collect may shrink confidence intervals but it will also magnify the effect of any survey bias. This paper uses recent vaccine surveys to show that happens. Ultimately, quality data is better than quantity data. Sponsored LinkHow to Get Strategic Value from Data AnalyticsRead this report to learn about the state of data analytics and the obstacles preventing its progress. Discover how broader accessibility and application of analytics in daily operations could help organizations produce more value. Get your free copy. Tutorials, Projects & OpinionsA Call to Build Models Like We Build Open-Source SoftwarePre-trained models are expensive to create and they're rarely updated, which means most of the research community gets left out of their design and creation. A lot could be gained by treating models more like open-source software. Here's how that could work. Data serialisation in RGreat post that shows how R serializes an in-memory data structure to create a sequence of bytes that can be saved or transmitted. Analyzing scraped data using Git and SQLitegit-scraping makes it easy to record snapshots of an online data source to a repository for tracking changes over time. But then what? git-history takes that repository and converts it to a SQLite database so it can be easily worked with. Combine that with something like Datasette and you can do some pretty cool stuff. How to prep a new Apple machine for data scienceGot a new Apple M1, M1 Pro or M1 Max? Here's a step-by-step guide to make sure you set it up correctly for TensorFlow and common data science libraries using Conda. How to make AI & BI work at scaleJoin leading technologists from DataRobot, Buoy Health, and AtScale for a panel on how to unlock the power of data and analytics for everyone, foster more self-service and greater data science literacy, and generate a better return on data science investments. Code & ToolsIntroducing the 🤗 Data Measurements ToolThe 🤗 Data Measurements Tool is an open-source project for building, measuring, and comparing NLP datasets -- no programming required! CogramCogram is a coding assistant for data science that's being built by a new YC startup. Cogram can generate Python code in Jupyter Notebook, and SQL from plain language, allowing data scientists to work more productively without looking up code syntax. It's available for free and they're especially looking for teams that might want customizations. ResourcesInteractive Tools for ML, DL and MathIf you like interactive playgrounds, this is a great collection of learning tools for Deep Learning, Machine Learning, Interpretability and Math. Data VisualizationPlotly ResamplerPlotly is an awesome interactive visualization library but it gets slow when there are a lot of data points. To fix that, this library downsamples the data that are in a view and then plots the downsampled points. Interview with Leland WilkinsonGreat interview with the late Leland Wilkinson where he describes highlights from his career and especially his book, The Grammar of Graphics. Leland was a legend in the data visualization community and sadly, he died at home last Friday. OutlierFinancial Modeling World Cup TournamentMove over, League of Legends. The real future of esports is spreadsheets and Microsoft Excel! To find specific content from prior issues or to research topics, check out the searchable Archives on Data Elixir's Search Page >> |
Older messages
Data Elixir - Issue 365
Tuesday, December 7, 2021
Consulting rates for DS. Tidy/Pandas visual tutors. State of Open Data. Unit testing in R. Mining the Pandora Papers.
Data Elixir - Issue 364
Tuesday, November 30, 2021
Shuffling the cloud. Data for good, responsibly. Decision tree viz. Longform NLP pipelines. Controlling the job hunt.
Data Elixir - Issue 363
Tuesday, November 23, 2021
Transformers from scratch. Open-source experiment tracker. Parameter exploration w/ Bayesian Optimization. Confidential computing.
Data Elixir - Issue 362
Tuesday, November 16, 2021
Holistic decision-making. ROI of data work. The Data Librarian. Automated root cause analysis. Scientific visualization. When ML hates veggies.
Data Elixir - Issue 361
Tuesday, November 9, 2021
Non-tech guide to interpretable ML. Avoiding data disasters. Beta regression. Measuring success. Better than box plots.
You Might Also Like
Import AI 399: 1,000 samples to make a reasoning model; DeepSeek proliferation; Apple's self-driving car simulator
Friday, February 14, 2025
What came before the golem? ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
Defining Your Paranoia Level: Navigating Change Without the Overkill
Friday, February 14, 2025
We've all been there: trying to learn something new, only to find our old habits holding us back. We discussed today how our gut feelings about solving problems can sometimes be our own worst enemy
5 ways AI can help with taxes 🪄
Friday, February 14, 2025
Remotely control an iPhone; 💸 50+ early Presidents' Day deals -- ZDNET ZDNET Tech Today - US February 10, 2025 5 ways AI can help you with your taxes (and what not to use it for) 5 ways AI can help
Recurring Automations + Secret Updates
Friday, February 14, 2025
Smarter automations, better templates, and hidden updates to explore 👀 ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
The First Provable AI-Proof Game: Introducing Butterfly Wings 4
Friday, February 14, 2025
Top Tech Content sent at Noon! Boost Your Article on HackerNoon for $159.99! Read this email in your browser How are you, @newsletterest1? undefined The Market Today #01 Instagram (Meta) 714.52 -0.32%
GCP Newsletter #437
Friday, February 14, 2025
Welcome to issue #437 February 10th, 2025 News BigQuery Cloud Marketplace Official Blog Partners BigQuery datasets now available on Google Cloud Marketplace - Google Cloud Marketplace now offers
Charted | The 1%'s Share of U.S. Wealth Over Time (1989-2024) 💰
Friday, February 14, 2025
Discover how the share of US wealth held by the top 1% has evolved from 1989 to 2024 in this infographic. View Online | Subscribe | Download Our App Download our app to see thousands of new charts from
The Great Social Media Diaspora & Tapestry is here
Friday, February 14, 2025
Apple introduces new app called 'Apple Invites', The Iconfactory launches Tapestry, beyond the traditional portfolio, and more in this week's issue of Creativerly. Creativerly The Great
Daily Coding Problem: Problem #1689 [Medium]
Friday, February 14, 2025
Daily Coding Problem Good morning! Here's your coding interview problem for today. This problem was asked by Google. Given a linked list, sort it in O(n log n) time and constant space. For example,
📧 Stop Conflating CQRS and MediatR
Friday, February 14, 2025
Stop Conflating CQRS and MediatR Read on: my website / Read time: 4 minutes The .NET Weekly is brought to you by: Step right up to the Generative AI Use Cases Repository! See how MongoDB powers your