Multimodal AI Models handling text+images/audio/video (in/out). From 10-AI-Concepts Deeper understanding (cats text+image). Apps: whiteboard analysis, medical scans+notes.