Merging design and computer science in creative ways

· Source: MIT News - Natural language processing · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Robotics & Autonomous Systems, Emerging Technologies & Innovation · Depth: Advanced, medium

Summary

MIT MAD Fellow Alexander Htet Kyaw, a dual master's student in architectural studies and electrical engineering, integrates AI, augmented reality (AR), and robotics with gesture and speech recognition to create human-AI workflows. His projects include Curator AI, an AR-powered platform that won $26,000 in OpenAI products and cash at an MIT hackathon, designed to streamline online furniture shopping by providing context-aware recommendations based on room dimensions and user prompts. Another project, Estimate, uses AR and object-recognition AI to generate detailed cost estimates for renovations and paint jobs for small businesses, earning $5,000. Kyaw also developed Unlog, an AR and gesture recognition system for mapping building component dimensions to construct sustainable structures from whole logs, and BendShelters, which uses physics simulation, gesture recognition, and AR to design reconfigurable bamboo shelters for displaced persons in Myanmar.

Key takeaway

For AI Scientists and Computer Vision Engineers developing human-computer interaction systems, consider integrating AR and gesture/speech recognition to create more intuitive and context-aware applications. Your focus on natural interaction can significantly enhance user experience in fields like retail, construction, and sustainable design, potentially leading to more impactful and user-friendly solutions.

Key insights

Integrating AI, AR, and robotics with natural interaction methods creates innovative human-machine workflows for design and construction.

Principles

Method

Combine AR for spatial data capture with vision-language AI for contextual recommendations. Integrate gesture recognition and physics simulation for interactive design and robotic fabrication of structures.

In practice

Topics

Best for: Computer Vision Engineer, AI Scientist, Research Scientist, AI Engineer, AI Researcher, AI Product Manager

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by MIT News - Natural language processing.