💧 Instill VDP defines the Instill Format as the data type identifier,
ensuring consistency and compatibility across components. Each input field in a
component will define instillAcceptFormats
, specifying the data types the
component can accept. Conversely, each output field will define instillFormat
,
indicating the data type the component produces. The design of 💧 Instill
VDP reference system ensures that all input fields reference the correct data
types, maintaining the integrity and functionality of the data pipeline.
💧 Instill VDP extends the Instill Format from the JSON primitive types and MIME types (IANA media types).
JSON Primitive Types:
string
number
integer
boolean
object
array
MIME Types:
text/*
: All kinds of text data, such astext/html
image/*
: All kinds of image data, such asimage/jpeg
video/*
: All kinds of video data, such asvideo/h264
audio/*
: All kinds of audio data, such asaudio/wav
application/*
: Other types of data, such asapplication/pdf
Semi-structured Data Types
semi-structured/json
: JSON data, which can include any JSON primitive typessemi-structured/object
Structured Data Types
Structured data types are specifically designed for various advanced tasks, particularly in the fields of Large Language Model (LLM) and Computer Vision (CV).
structured/chat-message
: Used in LLM tasksstructured/multi-modal-content
: Used in LLM tasksstructured/bounding-box
: Used in CV tasksstructured/detection-object
: Used in CV tasksstructured/instance-segmentation-object
: Used in CV tasksstructured/keypoint-object
: Used in CV tasksstructured/ocr-object
: Used in CV tasksstructured/semantic-segmentation-stuff
: Used in CV tasks
By adhering to these formats, 💧 Instill VDP ensures accurate and efficient data processing, enabling users to construct robust and scalable unstructured data pipelines. The utilization of standardized data types also promotes seamless integration and interoperability among various components and systems within the 💧 Instill VDP.