A model accepting and/or producing multiple content types (text, images, audio, video). Multimodal capabilities expand privacy and IP risk through processing of faces, voices, and biometrics, and may trigger additional regulatory obligations.
Multimodal model
C
T
See: Biometric data; Computer vision; LLM