Python Weekly - Python Weekly - Issue 666

View this email in your browser

Python Weekly

Welcome to issue 666 of Python Weekly. Let's get straight to the links this week.
From Our Sponsor 
Get Your Weekly Dose of Programming
A weekly newsletter featuring the best hand curated news, articles, tutorials, talks, tools and libraries etc for programmers. Join For Free

Articles, Tutorials and Talks

Building LLMs from the Ground Up 
This tutorial guides coders through the fundamentals of large language models (LLMs), explaining how they work and how to build them from scratch in PyTorch. It covers coding a small GPT-like model, its data pipeline, architecture, pretraining, and fine-tuning using open-source libraries.

Classifying all of the pdfs on the internet
The article describes an attempt to classify a massive dataset of 8.4 million PDFs from Common Crawl using various machine learning techniques. The author experiments with different approaches, including deep learning models and traditional machine learning methods like XGBoost, ultimately achieving the best performance with an XGBoost model trained on embeddings, reaching 85.26% accuracy after hyperparameter tuning.

Why I Still Use Python Virtual Environments in Docker
The article argues for using Python virtual environments in Docker containers, citing benefits like predictability, standardization, and easier debugging. The author contends that virtual environments provide a consistent, well-understood structure for Python applications, making communication and deployment across teams more straightforward, while also simplifying Python's import behavior.

My Favorite Error Handling Technique 
This video presents a surprising “Let it burn” approach to error handling, demonstrating how allowing code to fail fast can result in simpler, clearer, and more robust software. Discover the benefits of this method and its impact on improving overall code quality.

How we made querying Pandas DataFrames with chDB 87x faster
We just released chDB version 2.0, which lets you query Pandas DataFrames 87x faster than 1.0. In this blog post we'll explain how we did it.

Integrating Stripe Into A One-Product Django Python Shop
In the first part of this series, we created a Django online shop with htmx. In this second part, we'll handle orders using Stripe.

Shades of testing HTTP requests in Python
The post discusses various approaches to testing HTTP requests in Python applications, focusing on mocking external API calls during unit and integration testing.

Pure Python: Build a full stack ChatGPT-like UI. Reflex, Neon Postgres. Deploy with Docker to a VM 
This video tutorial demonstrates how to build a full-stack ChatGPT-like UI using Reflex, a Python framework for web development, integrating it with Neon Postgres database and OpenAI. It covers the entire process from setting up the development environment to deploying the application using Docker, GitHub Actions, and Ansible on a virtual machine.

Escaping from Anaconda's Stranglehold on macOS
The article provides a simple solution for macOS users to escape Anaconda's control over their Python environment by moving the .zshrc file out of the home directory. It offers step-by-step instructions for non-technical users to toggle between official Python and Anaconda versions without using command-line interfaces or editing files.

Taming the beast that is the Django ORM - An introduction
The Django ORM, how it compares to raw SQL and gotchas that you should be aware of when using it

FastAPI Logging in Google App Engine is Not a Nightmare Anymore with fastapi-gae-logging
Transform scattered logs into actionable insights with seamless Google Cloud integration for FastAPI apps.

Using GPT-4o for web scraping
The article discusses using GPT-4 with OpenAI's structured outputs feature to create an AI-assisted web scraper, exploring its capabilities in parsing complex tables and generating XPaths. While the author found GPT-4 effective at extracting data from various HTML tables, they also noted challenges with merged rows, high API costs, and the need for further refinements to improve accuracy and efficiency.

How to Create a Pre-Commit Hook
A step-by-step guide to developing your own pre-commit hook.

Multimodal Data Analysis with LLMs and Python – Tutorial
The tutorial teaches how to analyze multimodal data using Large Language Models (LLMs) and Python, covering text classification, image-based question answering, audio transcription, and creating a natural language query interface for SQL databases.


Interesting Projects, Tools and Libraries

Mini-Omni
Mini-Omni is an open-source multimodel large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.

kazam
Linux Screen Recorder, Broadcaster, Capture and OCR with AI in mind.

pipefunc
Lightweight function pipeline (DAG) creation in pure Python for scientific workflows.

supertree
supertree is a Python package designed to visualize decision trees in an interactive and user-friendly way within Jupyter Notebooks, Jupyter Lab, Google Colab, and any other notebooks that support HTML rendering. 

smartcut
Cut video files with minimal recoding.

kotaemon
An open-source RAG-based tool for chatting with your documents.

uvtrick
A fun party trick to run Python code from another venv into this one.

nlp-zero-to-hero
A comprehensive resource for learning Natural Language Processing (NLP) from the basics to advanced topics. It contains Jupyter notebooks covering various NLP concepts, techniques, and implementations, making it a valuable guide for beginners and intermediate learners in the field of NLP.

cookiecutter-uv
A modern cookiecutter template for Python projects that use uv for dependency management.


New Releases

Django security releases issued: 5.1.1, 5.0.9, and 4.2.16


Upcoming Events and Webinars

Django London Meetup September 2024
There will be following talks
  • Empowering Django with Background Workers
  • Pydantic Logfire — Uncomplicated Observability

Virtual: PyMNtos Python Presentation Night #128
There will be following talks
  • Job search automation with data scraping and machine learning
  • Using AutoGluon (AutoML) for Image Classification or Semantic Segmentation

PyData Johannesburg Meetup September 2024
There will be following talks
  • Not your typical RAG application
  • Building a Training Course Outline with Azure OpenAI
  • Enhancing Retrieval Augmented Generation with GraphRAG

PyData Eindhoven Meetup September 2024
There will be following talks
  • Embedding Software Engineering Best Practices into Machine Learning Projects with Kedro
  • Optimizing Rail Traffic Control using a Digital Twin and Reinforcement Learning

PyData Munich Meetup September 2024
There will be following talks
  • Aligning Signals: Key Learnings in LLMOps for Faster, Confident Development
  • Reusable AI: Customizing LLMs for Diverse Business Needs

PyData Lisbon Meetup September 2024
There will be following talks
  • Aequitas Flow: A Fair ML optimization framework
  • Llama to Llama 3.1 -- a year and a half of open-access LLMs in retrospective
Our Other Newsletters
Programmer Weekly - A free weekly newsletter for programmers.

Founder Weekly - A free weekly newsletter for entrepreneurs featuring best curated content, must read articles, how to guides, tips and tricks, resources, events and more.
 
Copyright © 2024 Python Weekly, All rights reserved.
You are receiving our weekly newsletter because you signed up at http://www.PythonWeekly.com.

Our mailing address is:
Python Weekly
Brooklyn
Brooklyn, NY 11209

Add us to your address book


Want to change how you receive these emails?
You can update your preferences or unsubscribe from this list.

Older messages

Python Weekly - Issue 665

Thursday, August 29, 2024

View this email in your browser Python Weekly Welcome to issue 665 of Python Weekly. Let's get straight to the links this week. Articles, Tutorials and Talks Quick and Easy Rate Limiting for

Python Weekly - Issue 664

Thursday, August 22, 2024

View this email in your browser Python Weekly Welcome to issue 664 of Python Weekly. Let's get straight to the links this week. From Our Sponsor Get Your Weekly Dose of Programming A weekly

Python Weekly - Issue 663

Tuesday, August 20, 2024

View this email in your browser Python Weekly Welcome to issue 663 of Python Weekly. Let's get straight to the links this week. Articles, Tutorials and Talks From Boring Object-Oriented to INSANE

Python Weekly - Issue 662

Thursday, August 8, 2024

View this email in your browser Python Weekly Welcome to issue 662 of Python Weekly. Let's get straight to the links this week. Articles, Tutorials and Talks FINALLY Python is Getting Rid of the

Python Weekly - Issue 661

Thursday, August 1, 2024

View this email in your browser Python Weekly Welcome to issue 661 of Python Weekly. Let's get straight to the links this week. From Our Sponsor Get Your Weekly Dose of Programming A weekly

You Might Also Like

Charted | Global Economic Confidence in 2025, by Country 🌎

Wednesday, December 25, 2024

While emerging markets in Asia have the strongest confidence in the global economy looking ahead, European countries are most pessimistic. View Online | Subscribe | Download Our App FEATURED STORY

Top Tech Deals 🎅 Sony Headphones, iPhone Cases, 4K Projector, and More!

Wednesday, December 25, 2024

The season of giving is upon us. How-To Geek Logo December 25, 2024 Top Tech Deals: Sony Headphones, iPhone Cases, 4K Projector, and More! The season of giving is upon us. Happy Holidays! If you're

Why the Race to AGI is Humanitys Defining Moment

Wednesday, December 25, 2024

Top Tech Content sent at Noon! Boost Your Article on HackerNoon for $159.99! Read this email in your browser How are you, @newsletterest1? 🪐 What's happening in tech today, December 25, 2024? The

Iran's Charming Kitten Deploys BellaCPP: A New C++ Variant of BellaCiao Malware

Wednesday, December 25, 2024

THN Daily Updates Newsletter cover The Data Science Handbook, 2nd Edition ($60.00 Value) FREE for a Limited Time Practical, accessible guide to becoming a data scientist, updated to include the latest

Software Testing Weekly - Issue 251

Wednesday, December 25, 2024

GitHub Copilot is free! 🤖 View on the Web Archives ISSUE 251 December 25th 2024 COMMENT Welcome to the 251st issue! In case you missed it — GitHub Copilot is free! The free version works with Visual

Daily Coding Problem: Problem #1647 [Medium]

Tuesday, December 24, 2024

Daily Coding Problem Good morning! Here's your coding interview problem for today. This problem was asked by Square. In front of you is a row of N coins, with values v 1 , v 1 , ..., v n . You are

Sentiment Analysis, Topological Sort, Web Security, and More

Tuesday, December 24, 2024

Exploring Modern Sentiment Analysis Approaches in Python #661 – DECEMBER 24, 2024 VIEW IN BROWSER The PyCoder's Weekly Logo Exploring Modern Sentiment Analysis Approaches in Python What are the

🤫 Do Not Disturb Mode Is My Secret to Sanity — 8 Gadgets I Want To See Nintendo Make

Tuesday, December 24, 2024

Also: The Best Christmas Movies to Watch on Netflix, and More! How-To Geek Logo December 24, 2024 Did You Know Their association with the Christmas season might make you think poinsettias hail from a

😱 AzureEdge.net DNS Retiring Jan. 2025, 🚀 Microsoft Phi-4 AI Outperforms, 🔒 Microsoft Secure Future Initiative

Tuesday, December 24, 2024

Blog | Advertise | View Online Your trusted source for Cloud, AI and DevOps guidance with industry expert Chris Pietschmann! Phi-4: Microsoft's New Small Language Model Outperforms Giants in AI

Mapped | The Top Health Insurance Companies by State 🏥

Tuesday, December 24, 2024

In 13 US states, a single company dominates the health insurance market, holding at least half of the total market share. View Online | Subscribe | Download Our App Presented by: Global X ETFs Power