Multimodal AI

Models handling text+images/audio/video (in/out).

From 10-AI-Concepts

  • Deeper understanding (cats text+image).
  • Apps: whiteboard analysis, medical scans+notes.