OpenAI Gets Into the Text-to-3D Game with Point-E
Was this email forwarded to you? Sign up here OpenAI Gets Into the Text-to-3D Game with Point-ESundays, The Sequence Scope brings a summary of the most important research papers, technology releases and VC funding deals in the artificial intelligence space.📝 EditorialI would like to start today’s editorial by wishing you a very blessed holiday season. 2022 has been a difficult year in tech in general but has had many excitments in AI including the explosion of innovations in generative AI which is today’s topic. The generative AI space continue pushing the boundaries of imagination in the deep learning space. Language and text-to-image models have been the areas to show the most progress but 3D is quickly surfacing as the next frontier. Generating 3D objects has resulted very challenging in deep learning given the lack of training datasets as well as the computational costs. Pretrained diffusion models removed some of these boundaries by being able to go from text to an image and then to the 3D object. Google recently unveiled some of their work in this area with the DreamFusion model and Stability AI has been making steady progress extending Stable Diffusion to 3D. Last week, OpenAI joined the race with the release of Point-E, a generative model that can produce 3D objects from language inputs. Point-E takes a very unique approach to the text-to-3D problem. Instead of generating a complete 3D object, Point-E generates a discrete set of data points that represents the 3D shape which is known as point clouds. From a computational standpoint, point clouds are way easier to synthesize. Point-E is based on two fundamental submodels: a text-to-image model based on diffusion methods and an image-to-3D model that generates the point cloud. OpenAI extended this architecture by adding the capability of coloring the point cloud resembling a complete 3D object. This area still has flaws. In addition to the research paper, OpenAI release an open source version of the model and is already included in HuggingFace. Language, images, video, 3D, the generative AI race is nothing short of fascinating. Point-E is certainly a great contribution and might be incorporated into a new version of DALL-E. 🗓 Next week in TheSequence Edge:Edge#255: Our series about ML interpretability continues by discussing the accumulated local effects(ALE) technique. The research section looks into OpenAI’s Microscope neuron visuation technnique and we discuss the IBM AI Explainability 360 stack . Edge#254: We deep dive into the architecture powering the famous ChatGPT. 🔎 ML ResearchPoint-E OpenAI published a paper detailing Point-E, a new language-to-3D generative model —> Read more. CALM Google Brain published a paper detailing confident adaptive language modeling(CALM), a method for improving the efficiency of large language models at inference time —> Read more. CoCoA-MT Amazon Science published a paper and open source dataset that improves formality control in large language models —> Read more. 🤖 Cool AI Tech ReleasesJasper Chat Generative AI startup Japer released Jasper Chat, a conversational interface to assist with the different business tasks automated in the platform —> Read more. Quora Poe Quora announced Poe, a conversational interface to interact with chatbots a la ChatGPT and receive instant answers —> Read more. New TensorFlow Models TensorFlow added new state-of-the-art quantized models to its Model Garden repository —> Read more. 🛠 Real World MLScaling ViT PyTorch discusses how to scale the vision transformer(ViT) model to 120 billion parameters. —> Read more. Auto Machine Translation at Amazon Amazon Science detailed the machine translation architecture used to translate the popular Dive Into Deep Learning textbook —> Read more. 💸 Money in AI
You’re on the free list for TheSequence Scope and TheSequence Chat. For the full experience, become a paying subscriber to TheSequence Edge. Trusted by thousands of subscribers from the leading AI labs and universities. |
Older messages
Edge 254: InstructGPT is the Model that Inspired the Famous ChatGPT
Thursday, December 22, 2022
The model fine tuned GPT-3 to improve its ability to follow instructions.
Edge 253: Interpretability Methods: Partial Dependence Plots
Tuesday, December 20, 2022
Partial dependence plots, interpretable time series forecasting and Google's fairness indicators.
Security: The Most Ignored Area of MLOps
Sunday, December 18, 2022
Sundays, The Sequence Scope brings a summary of the most important research papers, technology releases and VC funding deals in the artificial intelligence space.
Edge 252: Another Foundation Super Model: Google’s DreamFusion Can Convert Text to 3D
Friday, December 16, 2022
Another breakthrough in generative AI. DreamFusion uses diffusion models to generage 3D objects.
Edge 251: Global Model-Agnostic Interpretability
Tuesday, December 13, 2022
Global model-agnostic interpretability, student-teacher intrepetability methods and the Lucid library.
You Might Also Like
Last chance to register: SecOps made smarter
Monday, November 25, 2024
Don't miss this opportunity to learn how gen AI can transform your security workflowsㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤㅤ elastic | Search. Observe. Protect
SRE Weekly Issue #452
Monday, November 25, 2024
View on sreweekly.com A message from our sponsor, FireHydrant: Practice Makes Prepared: Why Every Minor System Hiccup Is Your Team's Secret Training Ground. https://firehydrant.com/blog/the-hidden-
Corporate Casserole 🥘
Monday, November 25, 2024
How marketing and lobbying inspired Thanksgiving traditions. Here's a version for your browser. Hunting for the end of the long tail • November 24, 2024 Hey all, Ernie here with a classic
WP Weekly 221 - Bluesky - WP Assets on CDN, Limit Font Subsets, ACF Pro Now
Monday, November 25, 2024
Read on Website WP Weekly 221 / Bluesky Have you joined Bluesky, like many other WordPress users, a new place for an online social presence? Also in this issue: CrawlWP, Asset Management Framework,
🤳🏻 We Need More High-End Small Phones — Linux Terminal Setup Tips
Sunday, November 24, 2024
Also: Why I Switched From Google Maps to Apple Maps, and More! How-To Geek Logo November 24, 2024 Did You Know Medieval moats didn't just protect castles from invaders approaching over land, but
JSK Daily for Nov 24, 2024
Sunday, November 24, 2024
JSK Daily for Nov 24, 2024 View this email in your browser A community curated daily e-mail of JavaScript news JavaScript Certification Black Friday Offer – Up to 54% Off! Certificates.dev, the trusted
OpenAI's turbulent early years - Sync #494
Sunday, November 24, 2024
Plus: Anthropic and xAI raise billions of dollars; can a fluffy robot replace a living pet; Chinese reasoning model DeepSeek R1; robot-dog runs full marathon; a $12000 surgery to change eye colour ͏ ͏
Daily Coding Problem: Problem #1618 [Easy]
Sunday, November 24, 2024
Daily Coding Problem Good morning! Here's your coding interview problem for today. This problem was asked by Zillow. Let's define a "sevenish" number to be one which is either a power
PD#602 How Netflix Built Self-Healing System to Survive Concurrency Bug
Sunday, November 24, 2024
CPUs were dying, the bug was temporarily un-fixable, and they had no viable path forward
RD#602 What are React Portals?
Sunday, November 24, 2024
A powerful feature that allows rendering components outside their parent component's DOM hierarchy