OpenAI Gets Into the Text-to-3D Game with Point-E
Was this email forwarded to you? Sign up here OpenAI Gets Into the Text-to-3D Game with Point-ESundays, The Sequence Scope brings a summary of the most important research papers, technology releases and VC funding deals in the artificial intelligence space.📝 EditorialI would like to start today’s editorial by wishing you a very blessed holiday season. 2022 has been a difficult year in tech in general but has had many excitments in AI including the explosion of innovations in generative AI which is today’s topic. The generative AI space continue pushing the boundaries of imagination in the deep learning space. Language and text-to-image models have been the areas to show the most progress but 3D is quickly surfacing as the next frontier. Generating 3D objects has resulted very challenging in deep learning given the lack of training datasets as well as the computational costs. Pretrained diffusion models removed some of these boundaries by being able to go from text to an image and then to the 3D object. Google recently unveiled some of their work in this area with the DreamFusion model and Stability AI has been making steady progress extending Stable Diffusion to 3D. Last week, OpenAI joined the race with the release of Point-E, a generative model that can produce 3D objects from language inputs. Point-E takes a very unique approach to the text-to-3D problem. Instead of generating a complete 3D object, Point-E generates a discrete set of data points that represents the 3D shape which is known as point clouds. From a computational standpoint, point clouds are way easier to synthesize. Point-E is based on two fundamental submodels: a text-to-image model based on diffusion methods and an image-to-3D model that generates the point cloud. OpenAI extended this architecture by adding the capability of coloring the point cloud resembling a complete 3D object. This area still has flaws. In addition to the research paper, OpenAI release an open source version of the model and is already included in HuggingFace. Language, images, video, 3D, the generative AI race is nothing short of fascinating. Point-E is certainly a great contribution and might be incorporated into a new version of DALL-E. 🗓 Next week in TheSequence Edge:Edge#255: Our series about ML interpretability continues by discussing the accumulated local effects(ALE) technique. The research section looks into OpenAI’s Microscope neuron visuation technnique and we discuss the IBM AI Explainability 360 stack . Edge#254: We deep dive into the architecture powering the famous ChatGPT. 🔎 ML ResearchPoint-E OpenAI published a paper detailing Point-E, a new language-to-3D generative model —> Read more. CALM Google Brain published a paper detailing confident adaptive language modeling(CALM), a method for improving the efficiency of large language models at inference time —> Read more. CoCoA-MT Amazon Science published a paper and open source dataset that improves formality control in large language models —> Read more. 🤖 Cool AI Tech ReleasesJasper Chat Generative AI startup Japer released Jasper Chat, a conversational interface to assist with the different business tasks automated in the platform —> Read more. Quora Poe Quora announced Poe, a conversational interface to interact with chatbots a la ChatGPT and receive instant answers —> Read more. New TensorFlow Models TensorFlow added new state-of-the-art quantized models to its Model Garden repository —> Read more. 🛠 Real World MLScaling ViT PyTorch discusses how to scale the vision transformer(ViT) model to 120 billion parameters. —> Read more. Auto Machine Translation at Amazon Amazon Science detailed the machine translation architecture used to translate the popular Dive Into Deep Learning textbook —> Read more. 💸 Money in AI
You’re on the free list for TheSequence Scope and TheSequence Chat. For the full experience, become a paying subscriber to TheSequence Edge. Trusted by thousands of subscribers from the leading AI labs and universities. |
Key phrases
Older messages
Edge 254: InstructGPT is the Model that Inspired the Famous ChatGPT
Thursday, December 22, 2022
The model fine tuned GPT-3 to improve its ability to follow instructions.
Edge 253: Interpretability Methods: Partial Dependence Plots
Tuesday, December 20, 2022
Partial dependence plots, interpretable time series forecasting and Google's fairness indicators.
Security: The Most Ignored Area of MLOps
Sunday, December 18, 2022
Sundays, The Sequence Scope brings a summary of the most important research papers, technology releases and VC funding deals in the artificial intelligence space.
Edge 252: Another Foundation Super Model: Google’s DreamFusion Can Convert Text to 3D
Friday, December 16, 2022
Another breakthrough in generative AI. DreamFusion uses diffusion models to generage 3D objects.
Edge 251: Global Model-Agnostic Interpretability
Tuesday, December 13, 2022
Global model-agnostic interpretability, student-teacher intrepetability methods and the Lucid library.
You Might Also Like
a16z’s Infrastructure team gets a new general partner
Friday, April 19, 2024
Post News is shutting down and Wall Street isn't feeling a Salesforce-Informatica pairing View this email online in your browser By Christine Hall Friday, April 19, 2024 Image Credits: Andreessen
New Roundtable! Additive for Mass Production Applications
Friday, April 19, 2024
The Outlook for the Future View this email in your browser engineering.com Roundtable - Additive for Mass Production Applications: The Outlook for the Future 6 Considerations for Choosing the Right
📷 What to Know About Macro Photography — Why You Should Buy a Budget Motherboard
Friday, April 19, 2024
Also: How to Automatically Highlight Values in Excel, and More! How-To Geek Logo April 19, 2024 📩 Get expert reviews, the hottest deals, how-to's, breaking news, and more delivered directly to your
Is the wind going out of the AI sails?
Friday, April 19, 2024
Rippling vacuums up venture capital and Ramp bags more millions View this email online in your browser By Haje Jan Kamps Friday, April 19, 2024 Image Credits: Getty Images / Carol Yepes Welcome to
Llama 3 is out - Weekly News Roundup - Issue #463
Friday, April 19, 2024
Plus: brand-new, all-electric Atlas; AI Index Report 2024; Microsoft pitched GenAI tools to US military; Humane AI Pin reviews are in; debunking Devin; and more! ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏ ͏
Daily Coding Problem: Problem #1417 [Easy]
Friday, April 19, 2024
Daily Coding Problem Good morning! Here's your coding interview problem for today. This problem was asked by Wayfair. You are given a 2 x N board, and instructed to completely cover the board with
Charted | How Hard Is It to Get Into an Ivy League School? 🎓
Friday, April 19, 2024
We detail the admission rates and average annual cost for Ivy League schools, as well as the median SAT scores required to be accepted. View Online | Subscribe Presented by: Discover the motivations
Dark Matter & Tortured Poets
Friday, April 19, 2024
New music releases aren't what they used to be -- for good and bad. Dark Matter & Tortured Poets By MG Siegler • 19 Apr 2024 View in browser View in browser New music releases in 2024 are a
Impact of AI on Product Management
Friday, April 19, 2024
Impact of AI on Product Management The rise of the AI Product Manager. Product managers have always championed customer's needs. However, with AI, the job requires new technical and ethical
⚙️ Zuck has entered the chat(bot)
Friday, April 19, 2024
Plus: AI video's coming to mobile!