Close Menu
    DevStackTipsDevStackTips
    • Home
    • News & Updates
      1. Tech & Work
      2. View All

      Sunshine And March Vibes (2025 Wallpapers Edition)

      May 11, 2025

      The Case For Minimal WordPress Setups: A Contrarian View On Theme Frameworks

      May 11, 2025

      How To Fix Largest Contentful Paint Issues With Subpart Analysis

      May 11, 2025

      How To Prevent WordPress SQL Injection Attacks

      May 11, 2025

      Triple 4K 144Hz displays from something this small blows my mind — CalDigit’s Element 5 Hub tested and reviewed

      May 11, 2025

      Ori and the Blind Forest studio head says they could be forced to “shut down” due to the impact of negative Steam reviews for ‘No Rest for the Wicked’

      May 11, 2025

      Windows Phone just got its first AI ChatGPT-style app. No, really.

      May 11, 2025

      New Xbox games launching this week, from May 12 through May 18 — DOOM: The Dark Ages arrives on Xbox Game Pass

      May 11, 2025
    • Development
      1. Algorithms & Data Structures
      2. Artificial Intelligence
      3. Back-End Development
      4. Databases
      5. Front-End Development
      6. Libraries & Frameworks
      7. Machine Learning
      8. Security
      9. Software Engineering
      10. Tools & IDEs
      11. Web Design
      12. Web Development
      13. Web Security
      14. Programming Languages
        • PHP
        • JavaScript
      Featured

      Brisa v0.2.13

      May 11, 2025
      Recent

      Brisa v0.2.13

      May 11, 2025

      Build Digital Assets & Earn Through Referrals with Biela — A Genuine Opportunity for Entrepreneurs

      May 11, 2025

      Laravel Routing

      May 10, 2025
    • Operating Systems
      1. Windows
      2. Linux
      3. macOS
      Featured

      Triple 4K 144Hz displays from something this small blows my mind — CalDigit’s Element 5 Hub tested and reviewed

      May 11, 2025
      Recent

      Triple 4K 144Hz displays from something this small blows my mind — CalDigit’s Element 5 Hub tested and reviewed

      May 11, 2025

      Ori and the Blind Forest studio head says they could be forced to “shut down” due to the impact of negative Steam reviews for ‘No Rest for the Wicked’

      May 11, 2025

      Windows Phone just got its first AI ChatGPT-style app. No, really.

      May 11, 2025
    • Learning Resources
      • Books
      • Cheatsheets
      • Tutorials & Guides
    Home»Development»Machine Learning»Tencent Released PrimitiveAnything: A New AI Framework That Reconstructs 3D Shapes Using Auto-Regressive Primitive Generation

    Tencent Released PrimitiveAnything: A New AI Framework That Reconstructs 3D Shapes Using Auto-Regressive Primitive Generation

    May 11, 2025

    Shape primitive abstraction, which breaks down complex 3D forms into simple, interpretable geometric units, is fundamental to human visual perception and has important implications for computer vision and graphics. While recent methods in 3D generation—using representations like meshes, point clouds, and neural fields—have enabled high-fidelity content creation, they often lack the semantic depth and interpretability needed for tasks such as robotic manipulation or scene understanding. Traditionally, primitive abstraction has been tackled using either optimization-based methods, which fit geometric primitives to shapes but often over-segment them semantically, or learning-based methods, which train on small, category-specific datasets and thus lack generalization. Early approaches used basic primitives like cuboids and cylinders, later evolving to more expressive forms like superquadrics. However, a major challenge persists in designing methods that can abstract shapes in a way that aligns with human cognition while also generalizing across diverse object categories.

    Inspired by recent breakthroughs in 3D content generation using large datasets and auto-regressive transformers, the authors propose reframing shape abstraction as a generative task. Rather than relying on geometric fitting or direct parameter regression, their approach sequentially constructs primitive assemblies to mirror human reasoning. This design more effectively captures both semantic structure and geometric accuracy. Prior works in auto-regressive modeling—such as MeshGPT and MeshAnything—have shown strong results in mesh generation by treating 3D shapes as sequences, incorporating innovations like compact tokenization and shape conditioning. 

    PrimitiveAnything is a framework developed by researchers from Tencent AIPD and Tsinghua University that redefines shape abstraction as a primitive assembly generation task. It introduces a decoder-only transformer conditioned on shape features to generate sequences of variable-length primitives. The framework employs a unified, ambiguity-free parameterization scheme that supports multiple primitive types while maintaining high geometric accuracy and learning efficiency. By learning directly from human-designed shape abstractions, PrimitiveAnything effectively captures how complex shapes are broken into simpler components. Its modular design supports easy integration of new primitive types, and experiments show it produces high-quality, perceptually aligned abstractions across diverse 3D shapes. 

    PrimitiveAnything is a framework that models 3D shape abstraction as a sequential generation task. It uses a discrete, ambiguity-free parameterization to represent each primitive’s type, translation, rotation, and scale. These are encoded and fed into a transformer, which predicts the next primitive based on prior ones and shape features extracted from point clouds. A cascaded decoder models dependencies between attributes, ensuring coherent generation. Training combines cross-entropy losses, Chamfer Distance for reconstruction accuracy, and Gumbel-Softmax for differentiable sampling. The process continues autoregressively until an end-of-sequence token signals completion, enabling flexible and human-like decomposition of complex 3D shapes. 

    The researchers introduce a large-scale HumanPrim dataset comprising 120K 3D samples with manually annotated primitive assemblies. Their method is evaluated using metrics like Chamfer Distance, Earth Mover’s Distance, Hausdorff Distance, Voxel-IoU, and segmentation scores (RI, VOI, SC). Compared to existing optimization- and learning-based methods, it shows superior performance and better alignment with human abstraction patterns. Ablation studies confirm the importance of each design component. Additionally, the framework supports 3D content generation from text or image inputs. It offers user-friendly editing, high modeling quality, and over 95% storage saving, making it well-suited for efficient and interactive 3D applications. 

    In conclusion, PrimitiveAnything is a new framework that approaches 3D shape abstraction as a sequence generation task. By learning from human-designed primitive assemblies, the model effectively captures intuitive decomposition patterns. It achieves high-quality results across various object categories, highlighting its strong generalization ability. The method also supports flexible 3D content creation using primitive-based representations. Due to its efficiency and lightweight structure, PrimitiveAnything is well-suited for enabling user-generated content in applications such as gaming, where both performance and ease of manipulation are essential. 


    Check out Paper, Demo and GitHub Page. All credit for this research goes to the researchers of this project. Also, feel free to follow us on Twitter and don’t forget to join our 90k+ ML SubReddit.

    Here’s a brief overview of what we’re building at Marktechpost:

    • ML News Community – r/machinelearningnews (92k+ members)
    • Newsletter– airesearchinsights.com/(30k+ subscribers)
    • miniCON AI Events – minicon.marktechpost.com
    • AI Reports & Magazines – magazine.marktechpost.com
    • AI Dev & Research News – marktechpost.com (1M+ monthly readers)
    • Partner with us

    The post Tencent Released PrimitiveAnything: A New AI Framework That Reconstructs 3D Shapes Using Auto-Regressive Primitive Generation appeared first on MarkTechPost.

    Source: Read More 

    Facebook Twitter Reddit Email Copy Link
    Previous ArticleA Coding Implementation of Accelerating Active Learning Annotation with Adala and Google Gemini
    Next Article ThumbnailPilot

    Related Posts

    Machine Learning

    How to Evaluate Jailbreak Methods: A Case Study with the StrongREJECT Benchmark

    May 11, 2025
    Machine Learning

    LightOn AI Released GTE-ModernColBERT-v1: A Scalable Token-Level Semantic Search Model for Long-Document Retrieval and Benchmark-Leading Performance

    May 11, 2025
    Leave A Reply Cancel Reply

    Continue Reading

    A regular expression refresher

    Development

    Xbox FY25 Q3 gaming revenue up 5% year-over-year, driven by growth in Call of Duty, Minecraft, and Xbox Game Pass

    News & Updates

    CVE-2025-2605 – Honeywell MB-Secure OS Command Injection Vulnerability

    Common Vulnerabilities and Exposures (CVEs)

    How we improved availability through iterative simplification

    Development

    Highlights

    Google to Simplify Chrome browser Security Settings Interface for Better User Experience

    April 24, 2025

    Security is paramount for the web browser, as it protects the websites you visit, the…

    There’s a secret smart home radio in these Macs and iPads – does yours have one?

    June 6, 2024

    What Are Scripts and How Do They Work? Improve Your Productivity with Scripting

    January 27, 2025

    Llama 2 to Llama 3: Meta’s Leap in Open-Source Language Models

    May 27, 2024
    © DevStackTips 2025. All rights reserved.
    • Contact
    • Privacy Policy

    Type above and press Enter to search. Press Esc to cancel.